summaryrefslogtreecommitdiffstats
path: root/python/fatcat_tools/importers/__init__.py
Commit message (Expand)AuthorAgeFilesLines
* refactor importer metadata tables into separate file; move some helpers aroundBryan Newbold2021-11-101-2/+1
* importers: refactor imports of clean() and other normalization helpersBryan Newbold2021-11-101-3/+0
* remove cdl_dash_dat and wayback_static importersBryan Newbold2021-11-101-2/+0
* re-fmt all the fatcat_tools __init__ files for readabilityBryan Newbold2021-11-021-17/+39
* initial implementation of fileset ingest importersBryan Newbold2021-10-141-1/+1
* generic fileset importer class, with test coverageBryan Newbold2021-10-141-0/+1
* new SPN web (html) importerBryan Newbold2021-10-011-1/+1
* very simple dblp container importerBryan Newbold2020-12-171-0/+1
* initial implementation of dblp release importer (in progress)Bryan Newbold2020-12-171-0/+1
* initial implementation of DOAJ importerBryan Newbold2020-11-191-0/+1
* ingest: initial 'web' worker implementationBryan Newbold2020-11-051-1/+1
* initial implementation of file_meta importerBryan Newbold2020-08-211-0/+1
* Merge branch 'martin-kafka-bs4-import' into 'master'Martin Czygan2020-03-101-1/+1
|\
| * pubmed ftp harvest and KafkaBs4XmlPusherMartin Czygan2020-02-191-1/+1
* | basic shadow importerBryan Newbold2020-02-131-0/+1
|/
* datacite: importer skeletonMartin Czygan2019-12-281-0/+1
* savepapernow result importerBryan Newbold2019-12-121-1/+1
* ingest file result importerBryan Newbold2019-11-151-2/+1
* implement ChoculaImporterBryan Newbold2019-09-031-0/+1
* faster LargeFile XML importer for PubMedBryan Newbold2019-05-291-1/+1
* creative importer for bulk JSTOR importsBryan Newbold2019-05-221-1/+1
* JALC bulk file importerBryan Newbold2019-05-211-1/+1
* initial pubmed importerBryan Newbold2019-05-211-2/+3
* initial arxivraw importer (from parser)Bryan Newbold2019-05-211-0/+1
* initial JSTOR importerBryan Newbold2019-05-211-0/+1
* initial flesh out of JALC parserBryan Newbold2019-05-211-1/+2
* early version of arabesque importerBryan Newbold2019-04-121-0/+1
* add SqlitePusher importer optionBryan Newbold2019-04-121-1/+1
* importer for CDL/DASH dat pilot dweb datasetsBryan Newbold2019-03-191-0/+1
* new importer: wayback_staticBryan Newbold2019-03-191-0/+1
* ftfy all over (needs Pipfile.lock)Bryan Newbold2019-01-231-1/+1
* refactor remaining importersBryan Newbold2019-01-221-1/+1
* refactored crossref importer to new styleBryan Newbold2019-01-221-3/+3
* new importer API interfacesBryan Newbold2019-01-221-0/+15
* issn => journal_metadata in several placesBryan Newbold2019-01-171-1/+1
* start supporting kafka importersBryan Newbold2018-11-191-1/+1
* large refactor of python names/pathsBryan Newbold2018-11-151-0/+7