aboutsummaryrefslogtreecommitdiffstats
path: root/python/fatcat_tools
Commit message (Collapse)AuthorAgeFilesLines
* allow importing contrib/refs listsBryan Newbold2019-01-241-5/+13
| | | | | | The motivation here isn't really to support these gigantic lists on principle, but to be able to ingest large corpuses without having to decide whether to filter out or crop such lists.
* notes on refactoring container 'extra'Bryan Newbold2019-01-241-0/+79
|
* importer bugfixesBryan Newbold2019-01-233-8/+14
|
* start changes to release ES schemaBryan Newbold2019-01-231-30/+90
|
* bunch of crossref import tweaks (need tests)Bryan Newbold2019-01-231-50/+43
|
* try to fix any_abstractBryan Newbold2019-01-231-1/+1
|
* clean() checks if it returns null-length stringBryan Newbold2019-01-231-1/+5
|
* matched importer: bezerk mode to skip file updatesBryan Newbold2019-01-231-11/+5
|
* ftfy all over (needs Pipfile.lock)Bryan Newbold2019-01-237-39/+74
|
* more tests; fix some importer behaviorBryan Newbold2019-01-232-37/+43
|
* improve changelog testsBryan Newbold2019-01-231-1/+0
|
* refactor remaining importersBryan Newbold2019-01-227-328/+297
|
* refactored crossref importer to new styleBryan Newbold2019-01-223-89/+166
|
* new importer API interfacesBryan Newbold2019-01-222-0/+181
|
* crossref importer updatesBryan Newbold2019-01-221-19/+78
|
* remove coden and abbrev from python toolsBryan Newbold2019-01-211-2/+0
|
* include filesets and webcaptures in exportsBryan Newbold2019-01-181-1/+1
|
* fix typo in elastic transform codeBryan Newbold2019-01-181-1/+1
|
* more 'true' -> True query param fixesBryan Newbold2019-01-184-4/+4
|
* state in elasticsearch (and deleted/redirects)Bryan Newbold2019-01-181-2/+8
|
* issn => journal_metadata in several placesBryan Newbold2019-01-172-6/+6
|
* use full-on autoaccept modeBryan Newbold2019-01-115-12/+16
| | | | | | | | Now that editor_id is infered from token, don't *need* to create ahead of time. This backend change simplifies things greatly (either update an existing editgroup, or create new and *only* include entities in the batch transaction), at the cost of being able to configure the editgroup in any way, including setting a description.
* Merge branch 'bnewbold-crude-auth'Bryan Newbold2019-01-0811-36/+131
|\ | | | | | | | | Fixed a conflict in: python/fatcat_export.py
| * workers do API-passing (not URI-passing)Bryan Newbold2019-01-082-9/+7
| |
| * importers and tests all use new api-passingBryan Newbold2019-01-086-10/+44
| |
| * start updating importer auth with crossref importerBryan Newbold2019-01-083-14/+40
| |
| * entity_to_json -> entity_to_dictBryan Newbold2019-01-082-2/+2
| |
| * start refactoring API object passingBryan Newbold2019-01-082-0/+41
| |
| * don't need to supply editor_id nowBryan Newbold2018-12-312-8/+4
| |
* | check request status codes idiomaticallyBryan Newbold2018-12-292-3/+3
|/
* python impl of API ident harmonizationBryan Newbold2018-12-246-36/+36
|
* implement release_year (and rustfmt)Bryan Newbold2018-12-243-9/+18
|
* do actually require title for crossref importBryan Newbold2018-12-011-3/+3
|
* fix file extraction (and transforms)Bryan Newbold2018-11-261-6/+6
|
* clean up harvester comments/docsBryan Newbold2018-11-213-50/+31
|
* crossref importer doesn't require author/title attributesBryan Newbold2018-11-211-6/+6
|
* crossref importer checks for existing DOIsBryan Newbold2018-11-212-4/+19
|
* use isoformat() to format datesBryan Newbold2018-11-213-5/+6
| | | | This shouldn't change behavior; it's just more consistent.
* grobid importer: release_date as a dateBryan Newbold2018-11-211-1/+1
|
* fix loop_sleep typoBryan Newbold2018-11-212-2/+2
|
* fix datacite DOI extractionBryan Newbold2018-11-211-1/+1
|
* fix OAI-PMH name/finished messageBryan Newbold2018-11-211-1/+6
|
* fix oai-pmh issue againBryan Newbold2018-11-211-13/+14
|
* oaipmh: handle NoRecordsMatchBryan Newbold2018-11-211-5/+8
|
* start supporting kafka importersBryan Newbold2018-11-192-1/+18
| | | | A nice feature would be some/any log output as to progress.
* fix some broken importer argsBryan Newbold2018-11-191-5/+7
|
* monograph isn't a CSL typeBryan Newbold2018-11-191-1/+1
|
* not as strong a todo (timestamps)Bryan Newbold2018-11-191-1/+1
|
* initial OAI-PMH harvestersBryan Newbold2018-11-193-5/+167
|
* better DOI registrar harvestersBryan Newbold2018-11-193-48/+145
|