Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | python impl of API ident harmonization | Bryan Newbold | 2018-12-24 | 6 | -36/+36 |
| | |||||
* | implement release_year (and rustfmt) | Bryan Newbold | 2018-12-24 | 3 | -9/+18 |
| | |||||
* | do actually require title for crossref import | Bryan Newbold | 2018-12-01 | 1 | -3/+3 |
| | |||||
* | fix file extraction (and transforms) | Bryan Newbold | 2018-11-26 | 1 | -6/+6 |
| | |||||
* | clean up harvester comments/docs | Bryan Newbold | 2018-11-21 | 3 | -50/+31 |
| | |||||
* | crossref importer doesn't require author/title attributes | Bryan Newbold | 2018-11-21 | 1 | -6/+6 |
| | |||||
* | crossref importer checks for existing DOIs | Bryan Newbold | 2018-11-21 | 2 | -4/+19 |
| | |||||
* | use isoformat() to format dates | Bryan Newbold | 2018-11-21 | 3 | -5/+6 |
| | | | | This shouldn't change behavior; it's just more consistent. | ||||
* | grobid importer: release_date as a date | Bryan Newbold | 2018-11-21 | 1 | -1/+1 |
| | |||||
* | fix loop_sleep typo | Bryan Newbold | 2018-11-21 | 2 | -2/+2 |
| | |||||
* | fix datacite DOI extraction | Bryan Newbold | 2018-11-21 | 1 | -1/+1 |
| | |||||
* | fix OAI-PMH name/finished message | Bryan Newbold | 2018-11-21 | 1 | -1/+6 |
| | |||||
* | fix oai-pmh issue again | Bryan Newbold | 2018-11-21 | 1 | -13/+14 |
| | |||||
* | oaipmh: handle NoRecordsMatch | Bryan Newbold | 2018-11-21 | 1 | -5/+8 |
| | |||||
* | start supporting kafka importers | Bryan Newbold | 2018-11-19 | 2 | -1/+18 |
| | | | | A nice feature would be some/any log output as to progress. | ||||
* | fix some broken importer args | Bryan Newbold | 2018-11-19 | 1 | -5/+7 |
| | |||||
* | monograph isn't a CSL type | Bryan Newbold | 2018-11-19 | 1 | -1/+1 |
| | |||||
* | not as strong a todo (timestamps) | Bryan Newbold | 2018-11-19 | 1 | -1/+1 |
| | |||||
* | initial OAI-PMH harvesters | Bryan Newbold | 2018-11-19 | 3 | -5/+167 |
| | |||||
* | better DOI registrar harvesters | Bryan Newbold | 2018-11-19 | 3 | -48/+145 |
| | |||||
* | bunch of pylint cleanup | Bryan Newbold | 2018-11-15 | 6 | -24/+38 |
| | |||||
* | large refactor of python names/paths | Bryan Newbold | 2018-11-15 | 13 | -30/+78 |
| | | | | | | | - Add __init__.py files for fatcat_tools submodules, and use them in imports - Add a bunch of comments to files. - rename a number of classes and functions to be less verbose | ||||
* | have recent message helper cleanup consumer | Bryan Newbold | 2018-11-15 | 1 | -1/+5 |
| | |||||
* | refactoring harvesters | Bryan Newbold | 2018-11-15 | 5 | -196/+210 |
| | |||||
* | initial work on metadata harvest bots | Bryan Newbold | 2018-11-14 | 4 | -0/+197 |
| | |||||
* | fix worker code | Bryan Newbold | 2018-11-14 | 2 | -2/+5 |
| | |||||
* | most_recent_message as reusable function | Bryan Newbold | 2018-11-14 | 2 | -26/+26 |
| | |||||
* | update crossref controlled vocab | Bryan Newbold | 2018-11-14 | 2 | -3/+32 |
| | |||||
* | python tweaks for date/datetime rust fix | Bryan Newbold | 2018-11-14 | 2 | -10/+3 |
| | |||||
* | switch to auto consumer offset updates | Bryan Newbold | 2018-11-13 | 2 | -2/+11 |
| | | | | | | This is the classic/correct way to do consumer group updates for higher throughput, when "at least once" semantics are acceptible (as they are here; double processing should be safe/fine). | ||||
* | to_elastic_dict -> release_elastic_dict | Bryan Newbold | 2018-11-13 | 1 | -1/+2 |
| | |||||
* | use Counter object instead of per-metric ints | Bryan Newbold | 2018-11-13 | 6 | -17/+17 |
| | |||||
* | more simple fatcat_client imports | Bryan Newbold | 2018-11-13 | 2 | -3/+2 |
| | |||||
* | shuffle around fatcat_tools layout | Bryan Newbold | 2018-11-13 | 10 | -73/+7 |
| | |||||
* | more python module refactoring | Bryan Newbold | 2018-11-12 | 8 | -8/+8 |
| | |||||
* | refactor python modules | Bryan Newbold | 2018-11-12 | 12 | -0/+1243 |