summaryrefslogtreecommitdiffstats
path: root/python/tests/files
Commit message (Collapse)AuthorAgeFilesLines
* release elasticsearch results: stage not statusBryan Newbold2019-06-131-1/+1
|
* JALC bulk file importerBryan Newbold2019-05-211-0/+100
|
* basic JALC XML DOI metadata parserBryan Newbold2019-05-211-0/+176
|
* basic JSTOR XML parserBryan Newbold2019-05-211-0/+58
|
* basic arxivraw XML parserBryan Newbold2019-05-211-0/+31
|
* basic pubmed parserBryan Newbold2019-05-211-0/+36822
|
* fix releases/release_ids in math_universe.json test fileBryan Newbold2019-05-201-1/+1
|
* importer code updatesBryan Newbold2019-05-131-1/+1
|
* update example release JSON to new schema (ext_id, release_stage)Bryan Newbold2019-05-132-11/+11
|
* arabesque import testsBryan Newbold2019-04-182-0/+10
|
* many web test improvementsBryan Newbold2019-04-042-0/+2
|
* more integration of transform refactorBryan Newbold2019-03-111-0/+10
|
* crossref import tweaks/fixesBryan Newbold2019-01-291-0/+1
| | | | | - refs: article-title not title; save unstructured; authors not author - save 'language' field (already an ISO code)
* fix matched test vectorBryan Newbold2019-01-281-1/+1
| | | | this was resulting in a collision with default/example database objects.
* update journal meta import/transformBryan Newbold2019-01-252-10/+20
|
* tweak crossref import, and update testsBryan Newbold2019-01-241-4/+20
|
* allow importing contrib/refs listsBryan Newbold2019-01-241-0/+0
| | | | | | The motivation here isn't really to support these gigantic lists on principle, but to be able to ingest large corpuses without having to decide whether to filter out or crop such lists.
* crossref importer updatesBryan Newbold2019-01-221-1/+1
|
* fix file extraction (and transforms)Bryan Newbold2018-11-261-0/+1
|
* improvements to grobid_metadata importerBryan Newbold2018-09-271-0/+10
| | | | | But still fails tests due to database collision/side-effect on sha1 lookup.
* more python example filesBryan Newbold2018-09-222-0/+424
|
* more matched testsBryan Newbold2018-09-141-0/+10
|
* switch manifest importer to be json-basedBryan Newbold2018-09-141-3/+3
|
* fixes to matched importer (and a test)Bryan Newbold2018-09-141-0/+3
|
* extid support for crossref importerBryan Newbold2018-09-121-0/+0
|
* fix python import of ORCIDs ending 'X'Bryan Newbold2018-09-101-0/+1
|
* improve handling of invalid identifiersBryan Newbold2018-08-152-0/+2
|
* ISSN importerBryan Newbold2018-06-211-0/+10
|
* importer tests and fixesBryan Newbold2018-06-202-10/+10
|
* more progress on crossref+orcid importersBryan Newbold2018-06-202-0/+13
|
* basic ORCID importerBryan Newbold2018-06-091-0/+1
|
* move python code to subdirectoryBryan Newbold2018-05-161-0/+10