aboutsummaryrefslogtreecommitdiffstats
path: root/python/fatcat_tools
Commit message (Collapse)AuthorAgeFilesLines
...
* new importer API interfacesBryan Newbold2019-01-222-0/+181
|
* crossref importer updatesBryan Newbold2019-01-221-19/+78
|
* remove coden and abbrev from python toolsBryan Newbold2019-01-211-2/+0
|
* include filesets and webcaptures in exportsBryan Newbold2019-01-181-1/+1
|
* fix typo in elastic transform codeBryan Newbold2019-01-181-1/+1
|
* more 'true' -> True query param fixesBryan Newbold2019-01-184-4/+4
|
* state in elasticsearch (and deleted/redirects)Bryan Newbold2019-01-181-2/+8
|
* issn => journal_metadata in several placesBryan Newbold2019-01-172-6/+6
|
* use full-on autoaccept modeBryan Newbold2019-01-115-12/+16
| | | | | | | | Now that editor_id is infered from token, don't *need* to create ahead of time. This backend change simplifies things greatly (either update an existing editgroup, or create new and *only* include entities in the batch transaction), at the cost of being able to configure the editgroup in any way, including setting a description.
* Merge branch 'bnewbold-crude-auth'Bryan Newbold2019-01-0811-36/+131
|\ | | | | | | | | Fixed a conflict in: python/fatcat_export.py
| * workers do API-passing (not URI-passing)Bryan Newbold2019-01-082-9/+7
| |
| * importers and tests all use new api-passingBryan Newbold2019-01-086-10/+44
| |
| * start updating importer auth with crossref importerBryan Newbold2019-01-083-14/+40
| |
| * entity_to_json -> entity_to_dictBryan Newbold2019-01-082-2/+2
| |
| * start refactoring API object passingBryan Newbold2019-01-082-0/+41
| |
| * don't need to supply editor_id nowBryan Newbold2018-12-312-8/+4
| |
* | check request status codes idiomaticallyBryan Newbold2018-12-292-3/+3
|/
* python impl of API ident harmonizationBryan Newbold2018-12-246-36/+36
|
* implement release_year (and rustfmt)Bryan Newbold2018-12-243-9/+18
|
* do actually require title for crossref importBryan Newbold2018-12-011-3/+3
|
* fix file extraction (and transforms)Bryan Newbold2018-11-261-6/+6
|
* clean up harvester comments/docsBryan Newbold2018-11-213-50/+31
|
* crossref importer doesn't require author/title attributesBryan Newbold2018-11-211-6/+6
|
* crossref importer checks for existing DOIsBryan Newbold2018-11-212-4/+19
|
* use isoformat() to format datesBryan Newbold2018-11-213-5/+6
| | | | This shouldn't change behavior; it's just more consistent.
* grobid importer: release_date as a dateBryan Newbold2018-11-211-1/+1
|
* fix loop_sleep typoBryan Newbold2018-11-212-2/+2
|
* fix datacite DOI extractionBryan Newbold2018-11-211-1/+1
|
* fix OAI-PMH name/finished messageBryan Newbold2018-11-211-1/+6
|
* fix oai-pmh issue againBryan Newbold2018-11-211-13/+14
|
* oaipmh: handle NoRecordsMatchBryan Newbold2018-11-211-5/+8
|
* start supporting kafka importersBryan Newbold2018-11-192-1/+18
| | | | A nice feature would be some/any log output as to progress.
* fix some broken importer argsBryan Newbold2018-11-191-5/+7
|
* monograph isn't a CSL typeBryan Newbold2018-11-191-1/+1
|
* not as strong a todo (timestamps)Bryan Newbold2018-11-191-1/+1
|
* initial OAI-PMH harvestersBryan Newbold2018-11-193-5/+167
|
* better DOI registrar harvestersBryan Newbold2018-11-193-48/+145
|
* bunch of pylint cleanupBryan Newbold2018-11-156-24/+38
|
* large refactor of python names/pathsBryan Newbold2018-11-1513-30/+78
| | | | | | | - Add __init__.py files for fatcat_tools submodules, and use them in imports - Add a bunch of comments to files. - rename a number of classes and functions to be less verbose
* have recent message helper cleanup consumerBryan Newbold2018-11-151-1/+5
|
* refactoring harvestersBryan Newbold2018-11-155-196/+210
|
* initial work on metadata harvest botsBryan Newbold2018-11-144-0/+197
|
* fix worker codeBryan Newbold2018-11-142-2/+5
|
* most_recent_message as reusable functionBryan Newbold2018-11-142-26/+26
|
* update crossref controlled vocabBryan Newbold2018-11-142-3/+32
|
* python tweaks for date/datetime rust fixBryan Newbold2018-11-142-10/+3
|
* switch to auto consumer offset updatesBryan Newbold2018-11-132-2/+11
| | | | | | This is the classic/correct way to do consumer group updates for higher throughput, when "at least once" semantics are acceptible (as they are here; double processing should be safe/fine).
* to_elastic_dict -> release_elastic_dictBryan Newbold2018-11-131-1/+2
|
* use Counter object instead of per-metric intsBryan Newbold2018-11-136-17/+17
|
* more simple fatcat_client importsBryan Newbold2018-11-132-3/+2
|