summaryrefslogtreecommitdiffstats
path: root/python/tests/files
Commit message (Expand)AuthorAgeFilesLines
* Merge branch 'martin-kafka-bs4-import' into 'master'Martin Czygan2020-03-102-0/+0
|\
| * more pubmed adjustmentsMartin Czygan2020-02-222-0/+0
* | Merge branch 'bnewbold-elastic-v03b'Bryan Newbold2020-02-263-0/+3
|\ \
| * | fix some transform bugs, add some testsBryan Newbold2020-01-293-0/+3
* | | shadow import: more filtering of file_meta fieldsBryan Newbold2020-02-131-12/+10
* | | basic shadow importerBryan Newbold2020-02-131-0/+12
| |/ |/|
* | datacite: add exception for https://www.micropublication.org/Martin Czygan2020-01-311-1/+2
* | datacite: improve date handling and minor tweakMartin Czygan2020-01-302-0/+110
|/
* do not normalize "en dash" in DOIMartin Czygan2020-01-171-1/+1
* ingest: improve tests, support old ingest resultsBryan Newbold2020-01-152-1/+2
* datacite: ignore known unknown values in resourceType*Martin Czygan2020-01-092-0/+94
* datacite: abstracts may be strings or list of stringsMartin Czygan2020-01-094-0/+186
* datacite: improve license_slug handlingMartin Czygan2020-01-092-1/+3
* datacite: add 'Unknown' to blacklistMartin Czygan2020-01-091-7/+1
* datacite: get rid of schemaVersionMartin Czygan2020-01-0917-32/+14
* datacite: reformat test cases and use jq . --sort-keysMartin Czygan2020-01-0854-2299/+2301
* datacite: factor out contributor handlingMartin Czygan2020-01-084-0/+105
* datacite: adjust tests for release_monthMartin Czygan2020-01-0812-12/+12
* datacite: mark additional files as stubMartin Czygan2020-01-082-0/+72
* datacite: CCDC are entries, mostlyMartin Czygan2020-01-081-1/+1
* datacite: adding datacite-specific extra metadataMartin Czygan2020-01-0730-1468/+1570
* datacite: month field should be top-levelMartin Czygan2020-01-0611-14/+14
* datacite: include month in extraMartin Czygan2020-01-0611-11/+13
* datacite: clean abstracts, use unknown value tokensMartin Czygan2020-01-063-3/+3
* datacite: always include "datacite" key in extraMartin Czygan2020-01-0414-26/+26
* datacite: remove --lang-detect flagMartin Czygan2020-01-035-10/+15
* datacite: add another test caseMartin Czygan2020-01-022-0/+70
* datacite: open case for editing after creationMartin Czygan2020-01-021-0/+2
* datacite: add helper script to create new test caseMartin Czygan2020-01-021-0/+14
* datacite: address raw_name index form commentMartin Czygan2020-01-0219-111/+111
* datacite: add conversion fixturesMartin Czygan2020-01-0249-0/+3924
* improve datacite field mapping and importMartin Czygan2019-12-282-0/+1
* datacite: add simple test and fixture for datacite api interactionMartin Czygan2019-12-271-0/+1
* add regression test for medlinedate -> year parsingBryan Newbold2019-12-231-0/+95
* add basic test for crossref harvest API callBryan Newbold2019-12-061-0/+1
* ingest file result importerBryan Newbold2019-11-151-0/+1
* release elasticsearch results: stage not statusBryan Newbold2019-06-131-1/+1
* JALC bulk file importerBryan Newbold2019-05-211-0/+100
* basic JALC XML DOI metadata parserBryan Newbold2019-05-211-0/+176
* basic JSTOR XML parserBryan Newbold2019-05-211-0/+58
* basic arxivraw XML parserBryan Newbold2019-05-211-0/+31
* basic pubmed parserBryan Newbold2019-05-211-0/+36822
* fix releases/release_ids in math_universe.json test fileBryan Newbold2019-05-201-1/+1
* importer code updatesBryan Newbold2019-05-131-1/+1
* update example release JSON to new schema (ext_id, release_stage)Bryan Newbold2019-05-132-11/+11
* arabesque import testsBryan Newbold2019-04-182-0/+10
* many web test improvementsBryan Newbold2019-04-042-0/+2
* more integration of transform refactorBryan Newbold2019-03-111-0/+10
* crossref import tweaks/fixesBryan Newbold2019-01-291-0/+1
* fix matched test vectorBryan Newbold2019-01-281-1/+1