Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | add python-snappy (for pykafka) | Bryan Newbold | 2018-12-10 | 2 | -31/+40 |
| | |||||
* | remove textIcuSearch elastic filter | Bryan Newbold | 2018-12-03 | 1 | -1/+0 |
| | | | | | | | This was breaking, eg, DOI search with non-trivial characters. Because we set the tokenizer on individual fields, shouldn't be necessary on the whole query. | ||||
* | fix empty ISSN bug (container view) | Bryan Newbold | 2018-12-03 | 1 | -2/+2 |
| | |||||
* | do actually require title for crossref import | Bryan Newbold | 2018-12-01 | 1 | -3/+3 |
| | |||||
* | codegen hide changes | Bryan Newbold | 2018-11-26 | 1 | -12/+60 |
| | |||||
* | fix file extraction (and transforms) | Bryan Newbold | 2018-11-26 | 3 | -6/+12 |
| | |||||
* | pass-through more API errors | Bryan Newbold | 2018-11-26 | 1 | -4/+13 |
| | |||||
* | include ApiException in shell | Bryan Newbold | 2018-11-21 | 1 | -0/+1 |
| | |||||
* | clean up harvester comments/docs | Bryan Newbold | 2018-11-21 | 3 | -50/+31 |
| | |||||
* | enable sentry exceptions for fatcat-web | Bryan Newbold | 2018-11-21 | 1 | -4/+8 |
| | |||||
* | enable sentry exceptions for fatcat-web | Bryan Newbold | 2018-11-21 | 4 | -2/+18 |
| | |||||
* | remove Pipfile cruft | Bryan Newbold | 2018-11-21 | 2 | -82/+1 |
| | |||||
* | ELASTICSEARCH not ELASTIC | Bryan Newbold | 2018-11-21 | 2 | -3/+3 |
| | |||||
* | crossref importer doesn't require author/title attributes | Bryan Newbold | 2018-11-21 | 1 | -6/+6 |
| | |||||
* | crossref importer checks for existing DOIs | Bryan Newbold | 2018-11-21 | 5 | -9/+42 |
| | |||||
* | use isoformat() to format dates | Bryan Newbold | 2018-11-21 | 3 | -5/+6 |
| | | | | This shouldn't change behavior; it's just more consistent. | ||||
* | grobid importer: release_date as a date | Bryan Newbold | 2018-11-21 | 1 | -1/+1 |
| | |||||
* | fix loop_sleep typo | Bryan Newbold | 2018-11-21 | 2 | -2/+2 |
| | |||||
* | fix datacite DOI extraction | Bryan Newbold | 2018-11-21 | 1 | -1/+1 |
| | |||||
* | fix OAI-PMH name/finished message | Bryan Newbold | 2018-11-21 | 2 | -7/+12 |
| | |||||
* | fix oai-pmh issue again | Bryan Newbold | 2018-11-21 | 1 | -13/+14 |
| | |||||
* | oaipmh: handle NoRecordsMatch | Bryan Newbold | 2018-11-21 | 1 | -5/+8 |
| | |||||
* | continous is a flag, not arg | Bryan Newbold | 2018-11-21 | 1 | -1/+1 |
| | |||||
* | correct kafka topic names | Bryan Newbold | 2018-11-20 | 2 | -13/+13 |
| | |||||
* | start supporting kafka importers | Bryan Newbold | 2018-11-19 | 4 | -5/+36 |
| | | | | A nice feature would be some/any log output as to progress. | ||||
* | fix some broken importer args | Bryan Newbold | 2018-11-19 | 1 | -5/+7 |
| | |||||
* | monograph isn't a CSL type | Bryan Newbold | 2018-11-19 | 1 | -1/+1 |
| | |||||
* | not as strong a todo (timestamps) | Bryan Newbold | 2018-11-19 | 1 | -1/+1 |
| | |||||
* | initial OAI-PMH harvesters | Bryan Newbold | 2018-11-19 | 6 | -133/+417 |
| | |||||
* | better DOI registrar harvesters | Bryan Newbold | 2018-11-19 | 5 | -50/+190 |
| | |||||
* | example shell interface for fatcat python lib | Bryan Newbold | 2018-11-19 | 2 | -0/+46 |
| | |||||
* | bunch of pylint cleanup | Bryan Newbold | 2018-11-15 | 9 | -29/+47 |
| | |||||
* | at least some pylint | Bryan Newbold | 2018-11-15 | 1 | -1/+1 |
| | |||||
* | large refactor of python names/paths | Bryan Newbold | 2018-11-15 | 32 | -107/+196 |
| | | | | | | | - Add __init__.py files for fatcat_tools submodules, and use them in imports - Add a bunch of comments to files. - rename a number of classes and functions to be less verbose | ||||
* | have recent message helper cleanup consumer | Bryan Newbold | 2018-11-15 | 1 | -1/+5 |
| | |||||
* | refactoring harvesters | Bryan Newbold | 2018-11-15 | 8 | -232/+297 |
| | |||||
* | initial work on metadata harvest bots | Bryan Newbold | 2018-11-14 | 5 | -6/+240 |
| | |||||
* | fix worker code | Bryan Newbold | 2018-11-14 | 2 | -2/+5 |
| | |||||
* | most_recent_message as reusable function | Bryan Newbold | 2018-11-14 | 2 | -26/+26 |
| | |||||
* | update crossref controlled vocab | Bryan Newbold | 2018-11-14 | 3 | -4/+39 |
| | |||||
* | python tweaks for date/datetime rust fix | Bryan Newbold | 2018-11-14 | 2 | -10/+3 |
| | |||||
* | switch to auto consumer offset updates | Bryan Newbold | 2018-11-13 | 2 | -2/+11 |
| | | | | | | This is the classic/correct way to do consumer group updates for higher throughput, when "at least once" semantics are acceptible (as they are here; double processing should be safe/fine). | ||||
* | webface: defer all javascript to end of body | Bryan Newbold | 2018-11-13 | 1 | -5/+5 |
| | |||||
* | webface: add input form labels | Bryan Newbold | 2018-11-13 | 3 | -6/+6 |
| | |||||
* | update metadata download links | Bryan Newbold | 2018-11-13 | 1 | -8/+9 |
| | |||||
* | to_elastic_dict -> release_elastic_dict | Bryan Newbold | 2018-11-13 | 1 | -1/+2 |
| | |||||
* | have webface default to local elasticsearch | Bryan Newbold | 2018-11-13 | 1 | -1/+2 |
| | |||||
* | initial/example uwsgi fatcat_web file | Bryan Newbold | 2018-11-13 | 1 | -0/+10 |
| | |||||
* | load more config from env | Bryan Newbold | 2018-11-13 | 1 | -3/+3 |
| | |||||
* | elasticsearch config for worker | Bryan Newbold | 2018-11-13 | 1 | -1/+8 |
| |