Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | clean up harvester comments/docs | Bryan Newbold | 2018-11-21 | 3 | -50/+31 |
| | |||||
* | enable sentry exceptions for fatcat-web | Bryan Newbold | 2018-11-21 | 1 | -4/+8 |
| | |||||
* | enable sentry exceptions for fatcat-web | Bryan Newbold | 2018-11-21 | 4 | -2/+18 |
| | |||||
* | remove Pipfile cruft | Bryan Newbold | 2018-11-21 | 2 | -82/+1 |
| | |||||
* | ELASTICSEARCH not ELASTIC | Bryan Newbold | 2018-11-21 | 2 | -3/+3 |
| | |||||
* | oops, I meant pylint, not pytest | Bryan Newbold | 2018-11-21 | 1 | -1/+1 |
| | |||||
* | fix gitlab pytest -E | Bryan Newbold | 2018-11-21 | 1 | -1/+1 |
| | |||||
* | crossref importer doesn't require author/title attributes | Bryan Newbold | 2018-11-21 | 1 | -6/+6 |
| | |||||
* | crossref importer checks for existing DOIs | Bryan Newbold | 2018-11-21 | 5 | -9/+42 |
| | |||||
* | use isoformat() to format dates | Bryan Newbold | 2018-11-21 | 3 | -5/+6 |
| | | | | This shouldn't change behavior; it's just more consistent. | ||||
* | grobid importer: release_date as a date | Bryan Newbold | 2018-11-21 | 1 | -1/+1 |
| | |||||
* | fix loop_sleep typo | Bryan Newbold | 2018-11-21 | 2 | -2/+2 |
| | |||||
* | fix datacite DOI extraction | Bryan Newbold | 2018-11-21 | 1 | -1/+1 |
| | |||||
* | fix OAI-PMH name/finished message | Bryan Newbold | 2018-11-21 | 2 | -7/+12 |
| | |||||
* | fix oai-pmh issue again | Bryan Newbold | 2018-11-21 | 1 | -13/+14 |
| | |||||
* | oaipmh: handle NoRecordsMatch | Bryan Newbold | 2018-11-21 | 1 | -5/+8 |
| | |||||
* | continous is a flag, not arg | Bryan Newbold | 2018-11-21 | 1 | -1/+1 |
| | |||||
* | correct kafka topic names | Bryan Newbold | 2018-11-20 | 2 | -13/+13 |
| | |||||
* | start supporting kafka importers | Bryan Newbold | 2018-11-19 | 4 | -5/+36 |
| | | | | A nice feature would be some/any log output as to progress. | ||||
* | fix some broken importer args | Bryan Newbold | 2018-11-19 | 1 | -5/+7 |
| | |||||
* | monograph isn't a CSL type | Bryan Newbold | 2018-11-19 | 2 | -2/+2 |
| | |||||
* | not as strong a todo (timestamps) | Bryan Newbold | 2018-11-19 | 1 | -1/+1 |
| | |||||
* | initial OAI-PMH harvesters | Bryan Newbold | 2018-11-19 | 6 | -133/+417 |
| | |||||
* | better DOI registrar harvesters | Bryan Newbold | 2018-11-19 | 5 | -50/+190 |
| | |||||
* | notes on DOI registrars (other than crossref) | Bryan Newbold | 2018-11-19 | 1 | -0/+39 |
| | |||||
* | example shell interface for fatcat python lib | Bryan Newbold | 2018-11-19 | 2 | -0/+46 |
| | |||||
* | bunch of pylint cleanup | Bryan Newbold | 2018-11-15 | 9 | -29/+47 |
| | |||||
* | at least some pylint | Bryan Newbold | 2018-11-15 | 2 | -3/+3 |
| | |||||
* | large refactor of python names/paths | Bryan Newbold | 2018-11-15 | 32 | -107/+196 |
| | | | | | | | - Add __init__.py files for fatcat_tools submodules, and use them in imports - Add a bunch of comments to files. - rename a number of classes and functions to be less verbose | ||||
* | have recent message helper cleanup consumer | Bryan Newbold | 2018-11-15 | 1 | -1/+5 |
| | |||||
* | refactoring harvesters | Bryan Newbold | 2018-11-15 | 8 | -232/+297 |
| | |||||
* | initial work on metadata harvest bots | Bryan Newbold | 2018-11-14 | 5 | -6/+240 |
| | |||||
* | fix worker code | Bryan Newbold | 2018-11-14 | 2 | -2/+5 |
| | |||||
* | update TODOs | Bryan Newbold | 2018-11-14 | 2 | -0/+4 |
| | |||||
* | most_recent_message as reusable function | Bryan Newbold | 2018-11-14 | 2 | -26/+26 |
| | |||||
* | update crossref controlled vocab | Bryan Newbold | 2018-11-14 | 3 | -4/+39 |
| | |||||
* | add (disabled) test that invalid fields should error | Bryan Newbold | 2018-11-14 | 1 | -0/+16 |
| | |||||
* | implement new controlled vocabularies | Bryan Newbold | 2018-11-14 | 3 | -16/+49 |
| | |||||
* | enforce some controlled vocabularies in API | Bryan Newbold | 2018-11-14 | 4 | -0/+128 |
| | |||||
* | python tweaks for date/datetime rust fix | Bryan Newbold | 2018-11-14 | 2 | -10/+3 |
| | |||||
* | disable breaking CI test | Bryan Newbold | 2018-11-14 | 2 | -3/+6 |
| | | | | | See commit comment for details; a problem with gitlab CI and setup() function, not the test in particular. Grump. | ||||
* | fix date/datetime confusion on rust/API side | Bryan Newbold | 2018-11-14 | 4 | -6/+72 |
| | | | | | | Should have dug in to this earlier; python code was getting confused. This is a breaking API change, from a practical standpoint, as both python and rust code had been hacked to work around this. | ||||
* | rustfmt | Bryan Newbold | 2018-11-14 | 4 | -59/+105 |
| | |||||
* | more kafka performance notes | Bryan Newbold | 2018-11-14 | 1 | -1/+15 |
| | |||||
* | bunch of notes on CSL alignment and types | Bryan Newbold | 2018-11-14 | 4 | -43/+196 |
| | |||||
* | switch to auto consumer offset updates | Bryan Newbold | 2018-11-13 | 3 | -2/+28 |
| | | | | | | This is the classic/correct way to do consumer group updates for higher throughput, when "at least once" semantics are acceptible (as they are here; double processing should be safe/fine). | ||||
* | webface: defer all javascript to end of body | Bryan Newbold | 2018-11-13 | 1 | -5/+5 |
| | |||||
* | webface: add input form labels | Bryan Newbold | 2018-11-13 | 3 | -6/+6 |
| | |||||
* | update metadata download links | Bryan Newbold | 2018-11-13 | 1 | -8/+9 |
| | |||||
* | to_elastic_dict -> release_elastic_dict | Bryan Newbold | 2018-11-13 | 1 | -1/+2 |
| |