Commit message (Collapse) | Author | Age | Files | Lines | ||
---|---|---|---|---|---|---|
... | ||||||
* | clean up harvester comments/docs | Bryan Newbold | 2018-11-21 | 3 | -50/+31 | |
| | ||||||
* | crossref importer doesn't require author/title attributes | Bryan Newbold | 2018-11-21 | 1 | -6/+6 | |
| | ||||||
* | crossref importer checks for existing DOIs | Bryan Newbold | 2018-11-21 | 2 | -4/+19 | |
| | ||||||
* | use isoformat() to format dates | Bryan Newbold | 2018-11-21 | 3 | -5/+6 | |
| | | | | This shouldn't change behavior; it's just more consistent. | |||||
* | grobid importer: release_date as a date | Bryan Newbold | 2018-11-21 | 1 | -1/+1 | |
| | ||||||
* | fix loop_sleep typo | Bryan Newbold | 2018-11-21 | 2 | -2/+2 | |
| | ||||||
* | fix datacite DOI extraction | Bryan Newbold | 2018-11-21 | 1 | -1/+1 | |
| | ||||||
* | fix OAI-PMH name/finished message | Bryan Newbold | 2018-11-21 | 1 | -1/+6 | |
| | ||||||
* | fix oai-pmh issue again | Bryan Newbold | 2018-11-21 | 1 | -13/+14 | |
| | ||||||
* | oaipmh: handle NoRecordsMatch | Bryan Newbold | 2018-11-21 | 1 | -5/+8 | |
| | ||||||
* | start supporting kafka importers | Bryan Newbold | 2018-11-19 | 2 | -1/+18 | |
| | | | | A nice feature would be some/any log output as to progress. | |||||
* | fix some broken importer args | Bryan Newbold | 2018-11-19 | 1 | -5/+7 | |
| | ||||||
* | monograph isn't a CSL type | Bryan Newbold | 2018-11-19 | 1 | -1/+1 | |
| | ||||||
* | not as strong a todo (timestamps) | Bryan Newbold | 2018-11-19 | 1 | -1/+1 | |
| | ||||||
* | initial OAI-PMH harvesters | Bryan Newbold | 2018-11-19 | 3 | -5/+167 | |
| | ||||||
* | better DOI registrar harvesters | Bryan Newbold | 2018-11-19 | 3 | -48/+145 | |
| | ||||||
* | bunch of pylint cleanup | Bryan Newbold | 2018-11-15 | 6 | -24/+38 | |
| | ||||||
* | large refactor of python names/paths | Bryan Newbold | 2018-11-15 | 13 | -30/+78 | |
| | | | | | | | - Add __init__.py files for fatcat_tools submodules, and use them in imports - Add a bunch of comments to files. - rename a number of classes and functions to be less verbose | |||||
* | have recent message helper cleanup consumer | Bryan Newbold | 2018-11-15 | 1 | -1/+5 | |
| | ||||||
* | refactoring harvesters | Bryan Newbold | 2018-11-15 | 5 | -196/+210 | |
| | ||||||
* | initial work on metadata harvest bots | Bryan Newbold | 2018-11-14 | 4 | -0/+197 | |
| | ||||||
* | fix worker code | Bryan Newbold | 2018-11-14 | 2 | -2/+5 | |
| | ||||||
* | most_recent_message as reusable function | Bryan Newbold | 2018-11-14 | 2 | -26/+26 | |
| | ||||||
* | update crossref controlled vocab | Bryan Newbold | 2018-11-14 | 2 | -3/+32 | |
| | ||||||
* | python tweaks for date/datetime rust fix | Bryan Newbold | 2018-11-14 | 2 | -10/+3 | |
| | ||||||
* | switch to auto consumer offset updates | Bryan Newbold | 2018-11-13 | 2 | -2/+11 | |
| | | | | | | This is the classic/correct way to do consumer group updates for higher throughput, when "at least once" semantics are acceptible (as they are here; double processing should be safe/fine). | |||||
* | to_elastic_dict -> release_elastic_dict | Bryan Newbold | 2018-11-13 | 1 | -1/+2 | |
| | ||||||
* | use Counter object instead of per-metric ints | Bryan Newbold | 2018-11-13 | 6 | -17/+17 | |
| | ||||||
* | more simple fatcat_client imports | Bryan Newbold | 2018-11-13 | 2 | -3/+2 | |
| | ||||||
* | shuffle around fatcat_tools layout | Bryan Newbold | 2018-11-13 | 10 | -73/+7 | |
| | ||||||
* | more python module refactoring | Bryan Newbold | 2018-11-12 | 8 | -8/+8 | |
| | ||||||
* | refactor python modules | Bryan Newbold | 2018-11-12 | 12 | -0/+1243 | |