| Commit message (Collapse) | Author | Age | Files | Lines | |
|---|---|---|---|---|---|
| * | fix file extraction (and transforms) | Bryan Newbold | 2018-11-26 | 1 | -6/+6 | 
| | | |||||
| * | clean up harvester comments/docs | Bryan Newbold | 2018-11-21 | 3 | -50/+31 | 
| | | |||||
| * | crossref importer doesn't require author/title attributes | Bryan Newbold | 2018-11-21 | 1 | -6/+6 | 
| | | |||||
| * | crossref importer checks for existing DOIs | Bryan Newbold | 2018-11-21 | 2 | -4/+19 | 
| | | |||||
| * | use isoformat() to format dates | Bryan Newbold | 2018-11-21 | 3 | -5/+6 | 
| | | | | | This shouldn't change behavior; it's just more consistent. | ||||
| * | grobid importer: release_date as a date | Bryan Newbold | 2018-11-21 | 1 | -1/+1 | 
| | | |||||
| * | fix loop_sleep typo | Bryan Newbold | 2018-11-21 | 2 | -2/+2 | 
| | | |||||
| * | fix datacite DOI extraction | Bryan Newbold | 2018-11-21 | 1 | -1/+1 | 
| | | |||||
| * | fix OAI-PMH name/finished message | Bryan Newbold | 2018-11-21 | 1 | -1/+6 | 
| | | |||||
| * | fix oai-pmh issue again | Bryan Newbold | 2018-11-21 | 1 | -13/+14 | 
| | | |||||
| * | oaipmh: handle NoRecordsMatch | Bryan Newbold | 2018-11-21 | 1 | -5/+8 | 
| | | |||||
| * | start supporting kafka importers | Bryan Newbold | 2018-11-19 | 2 | -1/+18 | 
| | | | | | A nice feature would be some/any log output as to progress. | ||||
| * | fix some broken importer args | Bryan Newbold | 2018-11-19 | 1 | -5/+7 | 
| | | |||||
| * | monograph isn't a CSL type | Bryan Newbold | 2018-11-19 | 1 | -1/+1 | 
| | | |||||
| * | not as strong a todo (timestamps) | Bryan Newbold | 2018-11-19 | 1 | -1/+1 | 
| | | |||||
| * | initial OAI-PMH harvesters | Bryan Newbold | 2018-11-19 | 3 | -5/+167 | 
| | | |||||
| * | better DOI registrar harvesters | Bryan Newbold | 2018-11-19 | 3 | -48/+145 | 
| | | |||||
| * | bunch of pylint cleanup | Bryan Newbold | 2018-11-15 | 6 | -24/+38 | 
| | | |||||
| * | large refactor of python names/paths | Bryan Newbold | 2018-11-15 | 13 | -30/+78 | 
| | | | | | | | | - Add __init__.py files for fatcat_tools submodules, and use them in imports - Add a bunch of comments to files. - rename a number of classes and functions to be less verbose | ||||
| * | have recent message helper cleanup consumer | Bryan Newbold | 2018-11-15 | 1 | -1/+5 | 
| | | |||||
| * | refactoring harvesters | Bryan Newbold | 2018-11-15 | 5 | -196/+210 | 
| | | |||||
| * | initial work on metadata harvest bots | Bryan Newbold | 2018-11-14 | 4 | -0/+197 | 
| | | |||||
| * | fix worker code | Bryan Newbold | 2018-11-14 | 2 | -2/+5 | 
| | | |||||
| * | most_recent_message as reusable function | Bryan Newbold | 2018-11-14 | 2 | -26/+26 | 
| | | |||||
| * | update crossref controlled vocab | Bryan Newbold | 2018-11-14 | 2 | -3/+32 | 
| | | |||||
| * | python tweaks for date/datetime rust fix | Bryan Newbold | 2018-11-14 | 2 | -10/+3 | 
| | | |||||
| * | switch to auto consumer offset updates | Bryan Newbold | 2018-11-13 | 2 | -2/+11 | 
| | | | | | | | This is the classic/correct way to do consumer group updates for higher throughput, when "at least once" semantics are acceptible (as they are here; double processing should be safe/fine). | ||||
| * | to_elastic_dict -> release_elastic_dict | Bryan Newbold | 2018-11-13 | 1 | -1/+2 | 
| | | |||||
| * | use Counter object instead of per-metric ints | Bryan Newbold | 2018-11-13 | 6 | -17/+17 | 
| | | |||||
| * | more simple fatcat_client imports | Bryan Newbold | 2018-11-13 | 2 | -3/+2 | 
| | | |||||
| * | shuffle around fatcat_tools layout | Bryan Newbold | 2018-11-13 | 10 | -73/+7 | 
| | | |||||
| * | more python module refactoring | Bryan Newbold | 2018-11-12 | 8 | -8/+8 | 
| | | |||||
| * | refactor python modules | Bryan Newbold | 2018-11-12 | 12 | -0/+1243 | 
