summaryrefslogtreecommitdiffstats
path: root/python
Commit message (Collapse)AuthorAgeFilesLines
* more redirect/state testsBryan Newbold2018-12-211-0/+15
|
* disallow self-redirectsBryan Newbold2018-12-201-0/+24
|
* update redirect testsBryan Newbold2018-12-201-22/+66
|
* test codegenBryan Newbold2018-12-141-0/+120
| | | | | I didn't actually notice until recently that these are empty and thus useless, sigh.
* whole bunch of entity redirect tests (python)Bryan Newbold2018-12-142-0/+342
| | | | | Some of the edge cases are marked skip while I re-think what behavior should actually be.
* multiple lookup extids (web view, no tests yet)Bryan Newbold2018-12-141-12/+25
|
* python codegenBryan Newbold2018-12-146-123/+2162
|
* fix brittle/flapping python matched testBryan Newbold2018-12-141-4/+6
|
* add python-snappy (for pykafka)Bryan Newbold2018-12-102-31/+40
|
* remove textIcuSearch elastic filterBryan Newbold2018-12-031-1/+0
| | | | | | | This was breaking, eg, DOI search with non-trivial characters. Because we set the tokenizer on individual fields, shouldn't be necessary on the whole query.
* fix empty ISSN bug (container view)Bryan Newbold2018-12-031-2/+2
|
* do actually require title for crossref importBryan Newbold2018-12-011-3/+3
|
* codegen hide changesBryan Newbold2018-11-261-12/+60
|
* fix file extraction (and transforms)Bryan Newbold2018-11-263-6/+12
|
* pass-through more API errorsBryan Newbold2018-11-261-4/+13
|
* include ApiException in shellBryan Newbold2018-11-211-0/+1
|
* clean up harvester comments/docsBryan Newbold2018-11-213-50/+31
|
* enable sentry exceptions for fatcat-webBryan Newbold2018-11-211-4/+8
|
* enable sentry exceptions for fatcat-webBryan Newbold2018-11-214-2/+18
|
* remove Pipfile cruftBryan Newbold2018-11-212-82/+1
|
* ELASTICSEARCH not ELASTICBryan Newbold2018-11-212-3/+3
|
* crossref importer doesn't require author/title attributesBryan Newbold2018-11-211-6/+6
|
* crossref importer checks for existing DOIsBryan Newbold2018-11-215-9/+42
|
* use isoformat() to format datesBryan Newbold2018-11-213-5/+6
| | | | This shouldn't change behavior; it's just more consistent.
* grobid importer: release_date as a dateBryan Newbold2018-11-211-1/+1
|
* fix loop_sleep typoBryan Newbold2018-11-212-2/+2
|
* fix datacite DOI extractionBryan Newbold2018-11-211-1/+1
|
* fix OAI-PMH name/finished messageBryan Newbold2018-11-212-7/+12
|
* fix oai-pmh issue againBryan Newbold2018-11-211-13/+14
|
* oaipmh: handle NoRecordsMatchBryan Newbold2018-11-211-5/+8
|
* continous is a flag, not argBryan Newbold2018-11-211-1/+1
|
* correct kafka topic namesBryan Newbold2018-11-202-13/+13
|
* start supporting kafka importersBryan Newbold2018-11-194-5/+36
| | | | A nice feature would be some/any log output as to progress.
* fix some broken importer argsBryan Newbold2018-11-191-5/+7
|
* monograph isn't a CSL typeBryan Newbold2018-11-191-1/+1
|
* not as strong a todo (timestamps)Bryan Newbold2018-11-191-1/+1
|
* initial OAI-PMH harvestersBryan Newbold2018-11-196-133/+417
|
* better DOI registrar harvestersBryan Newbold2018-11-195-50/+190
|
* example shell interface for fatcat python libBryan Newbold2018-11-192-0/+46
|
* bunch of pylint cleanupBryan Newbold2018-11-159-29/+47
|
* at least some pylintBryan Newbold2018-11-151-1/+1
|
* large refactor of python names/pathsBryan Newbold2018-11-1532-107/+196
| | | | | | | - Add __init__.py files for fatcat_tools submodules, and use them in imports - Add a bunch of comments to files. - rename a number of classes and functions to be less verbose
* have recent message helper cleanup consumerBryan Newbold2018-11-151-1/+5
|
* refactoring harvestersBryan Newbold2018-11-158-232/+297
|
* initial work on metadata harvest botsBryan Newbold2018-11-145-6/+240
|
* fix worker codeBryan Newbold2018-11-142-2/+5
|
* most_recent_message as reusable functionBryan Newbold2018-11-142-26/+26
|
* update crossref controlled vocabBryan Newbold2018-11-143-4/+39
|
* python tweaks for date/datetime rust fixBryan Newbold2018-11-142-10/+3
|
* switch to auto consumer offset updatesBryan Newbold2018-11-132-2/+11
| | | | | | This is the classic/correct way to do consumer group updates for higher throughput, when "at least once" semantics are acceptible (as they are here; double processing should be safe/fine).