aboutsummaryrefslogtreecommitdiffstats
path: root/python/fatcat_import.py
Commit message (Expand)AuthorAgeFilesLines
* have ingest-file-results importer operate as crawl-botBryan Newbold2019-11-151-1/+1
* better ingest-file-results import nameBryan Newbold2019-11-151-1/+1
* ingest file result importerBryan Newbold2019-11-151-0/+34
* small fixes to confluent-kafka importers/workersBryan Newbold2019-09-201-1/+1
* convert importers to confluent-kafka libraryBryan Newbold2019-09-201-2/+3
* start chocula importerBryan Newbold2019-09-031-0/+14
* support extids in matched importerBryan Newbold2019-06-201-0/+4
* faster LargeFile XML importer for PubMedBryan Newbold2019-05-291-1/+1
* make pubmed ref lookups configurableBryan Newbold2019-05-221-1/+8
* creative importer for bulk JSTOR importsBryan Newbold2019-05-221-0/+18
* pubmed importer command and tweaksBryan Newbold2019-05-221-0/+25
* arxiv importer robustification and CLI implBryan Newbold2019-05-211-0/+21
* JALC bulk file importerBryan Newbold2019-05-211-0/+21
* fix default mimetype (impacted pre-1923 files)Bryan Newbold2019-05-151-1/+5
* editgroup description overrideBryan Newbold2019-04-221-1/+11
* minor arabesque tweaksBryan Newbold2019-04-181-12/+22
* arabesque importer using crawl-bot credsBryan Newbold2019-04-181-1/+1
* arabesque import tweaksBryan Newbold2019-04-181-0/+4
* early version of arabesque importerBryan Newbold2019-04-121-0/+28
* importer for CDL/DASH dat pilot dweb datasetsBryan Newbold2019-03-191-1/+29
* new importer: wayback_staticBryan Newbold2019-03-191-0/+48
* reduce default import batch size to 50Bryan Newbold2019-01-291-1/+1
* batch size as a general import paramBryan Newbold2019-01-281-13/+4
* add missing bezerk-mode flag to GROBID importBryan Newbold2019-01-281-3/+8
* fix typo in crossref importerBryan Newbold2019-01-281-1/+1
* update journal meta import/transformBryan Newbold2019-01-251-3/+3
* more import script fixesBryan Newbold2019-01-231-1/+4
* update importer scriptBryan Newbold2019-01-231-33/+24
* pubmed+datacite tokens; no journal,grobid,matched tokensBryan Newbold2019-01-221-2/+2
* issn => journal_metadata in several placesBryan Newbold2019-01-171-9/+9
* start refactoring API object passingBryan Newbold2019-01-081-13/+36
* crossref importer checks for existing DOIsBryan Newbold2018-11-211-3/+7
* correct kafka topic namesBryan Newbold2018-11-201-1/+1
* start supporting kafka importersBryan Newbold2018-11-191-3/+17
* bunch of pylint cleanupBryan Newbold2018-11-151-1/+1
* large refactor of python names/pathsBryan Newbold2018-11-151-39/+37
* shuffle around fatcat_tools layoutBryan Newbold2018-11-131-5/+5
* more python module refactoringBryan Newbold2018-11-121-5/+5
* remove more old python cruftBryan Newbold2018-11-121-8/+0
* fixes for grobid metadata importerBryan Newbold2018-09-281-0/+15
* fix issues with extid mapping in crossref-importerBryan Newbold2018-09-201-1/+1
* switch manifest importer to be json-basedBryan Newbold2018-09-141-16/+2
* add insert counting to importersBryan Newbold2018-09-141-0/+22
* extid support for crossref importerBryan Newbold2018-09-121-2/+5
* rename python scriptsBryan Newbold2018-07-261-0/+94