aboutsummaryrefslogtreecommitdiffstats
path: root/python
Commit message (Collapse)AuthorAgeFilesLines
* fix brittle/flapping python matched testBryan Newbold2018-12-141-4/+6
|
* add python-snappy (for pykafka)Bryan Newbold2018-12-102-31/+40
|
* remove textIcuSearch elastic filterBryan Newbold2018-12-031-1/+0
| | | | | | | This was breaking, eg, DOI search with non-trivial characters. Because we set the tokenizer on individual fields, shouldn't be necessary on the whole query.
* fix empty ISSN bug (container view)Bryan Newbold2018-12-031-2/+2
|
* do actually require title for crossref importBryan Newbold2018-12-011-3/+3
|
* codegen hide changesBryan Newbold2018-11-261-12/+60
|
* fix file extraction (and transforms)Bryan Newbold2018-11-263-6/+12
|
* pass-through more API errorsBryan Newbold2018-11-261-4/+13
|
* include ApiException in shellBryan Newbold2018-11-211-0/+1
|
* clean up harvester comments/docsBryan Newbold2018-11-213-50/+31
|
* enable sentry exceptions for fatcat-webBryan Newbold2018-11-211-4/+8
|
* enable sentry exceptions for fatcat-webBryan Newbold2018-11-214-2/+18
|
* remove Pipfile cruftBryan Newbold2018-11-212-82/+1
|
* ELASTICSEARCH not ELASTICBryan Newbold2018-11-212-3/+3
|
* crossref importer doesn't require author/title attributesBryan Newbold2018-11-211-6/+6
|
* crossref importer checks for existing DOIsBryan Newbold2018-11-215-9/+42
|
* use isoformat() to format datesBryan Newbold2018-11-213-5/+6
| | | | This shouldn't change behavior; it's just more consistent.
* grobid importer: release_date as a dateBryan Newbold2018-11-211-1/+1
|
* fix loop_sleep typoBryan Newbold2018-11-212-2/+2
|
* fix datacite DOI extractionBryan Newbold2018-11-211-1/+1
|
* fix OAI-PMH name/finished messageBryan Newbold2018-11-212-7/+12
|
* fix oai-pmh issue againBryan Newbold2018-11-211-13/+14
|
* oaipmh: handle NoRecordsMatchBryan Newbold2018-11-211-5/+8
|
* continous is a flag, not argBryan Newbold2018-11-211-1/+1
|
* correct kafka topic namesBryan Newbold2018-11-202-13/+13
|
* start supporting kafka importersBryan Newbold2018-11-194-5/+36
| | | | A nice feature would be some/any log output as to progress.
* fix some broken importer argsBryan Newbold2018-11-191-5/+7
|
* monograph isn't a CSL typeBryan Newbold2018-11-191-1/+1
|
* not as strong a todo (timestamps)Bryan Newbold2018-11-191-1/+1
|
* initial OAI-PMH harvestersBryan Newbold2018-11-196-133/+417
|
* better DOI registrar harvestersBryan Newbold2018-11-195-50/+190
|
* example shell interface for fatcat python libBryan Newbold2018-11-192-0/+46
|
* bunch of pylint cleanupBryan Newbold2018-11-159-29/+47
|
* at least some pylintBryan Newbold2018-11-151-1/+1
|
* large refactor of python names/pathsBryan Newbold2018-11-1532-107/+196
| | | | | | | - Add __init__.py files for fatcat_tools submodules, and use them in imports - Add a bunch of comments to files. - rename a number of classes and functions to be less verbose
* have recent message helper cleanup consumerBryan Newbold2018-11-151-1/+5
|
* refactoring harvestersBryan Newbold2018-11-158-232/+297
|
* initial work on metadata harvest botsBryan Newbold2018-11-145-6/+240
|
* fix worker codeBryan Newbold2018-11-142-2/+5
|
* most_recent_message as reusable functionBryan Newbold2018-11-142-26/+26
|
* update crossref controlled vocabBryan Newbold2018-11-143-4/+39
|
* python tweaks for date/datetime rust fixBryan Newbold2018-11-142-10/+3
|
* switch to auto consumer offset updatesBryan Newbold2018-11-132-2/+11
| | | | | | This is the classic/correct way to do consumer group updates for higher throughput, when "at least once" semantics are acceptible (as they are here; double processing should be safe/fine).
* webface: defer all javascript to end of bodyBryan Newbold2018-11-131-5/+5
|
* webface: add input form labelsBryan Newbold2018-11-133-6/+6
|
* update metadata download linksBryan Newbold2018-11-131-8/+9
|
* to_elastic_dict -> release_elastic_dictBryan Newbold2018-11-131-1/+2
|
* have webface default to local elasticsearchBryan Newbold2018-11-131-1/+2
|
* initial/example uwsgi fatcat_web fileBryan Newbold2018-11-131-0/+10
|
* load more config from envBryan Newbold2018-11-131-3/+3
|