summaryrefslogtreecommitdiffstats
path: root/python
Commit message (Expand)AuthorAgeFilesLines
* transform ingests via pmc/pmcid, not pubmed/pmidBryan Newbold2019-12-241-4/+4
* allow arabesque backfill ingests for some source typesBryan Newbold2019-12-241-0/+5
* make chocula URL updates more conservativeBryan Newbold2019-12-241-5/+5
* pubmed: if doing update, also do subtitle schema updateBryan Newbold2019-12-231-1/+9
* doi parsing fixesBryan Newbold2019-12-231-0/+7
* pubmed: improve warning and stderr formattingBryan Newbold2019-12-231-5/+6
* pubmed: use standard identifier cleanersBryan Newbold2019-12-231-17/+14
* pubmed: remove unused extid mapping codeBryan Newbold2019-12-231-29/+0
* pubmed: do reference lookups by defaultBryan Newbold2019-12-231-1/+1
* normalizers: clean_pmid(), and handle nulls in all other cleanersBryan Newbold2019-12-231-0/+31
* pubmed: null doi parsing checkBryan Newbold2019-12-231-1/+1
* add basic MedlineDate year parsingBryan Newbold2019-12-231-0/+11
* add regression test for medlinedate -> year parsingBryan Newbold2019-12-232-0/+102
* fix spn/ingest importer duplication checkBryan Newbold2019-12-221-6/+8
* datacite release links and metadata expansionBryan Newbold2019-12-202-9/+13
* spn: incluce link_source/link_source_id in ingest requestBryan Newbold2019-12-201-0/+2
* pipenv: update depsBryan Newbold2019-12-172-11/+55
* pipenv: restrict pytest<5.0.0Bryan Newbold2019-12-172-5/+13
* pipenv: update Pipfile and Pipfile.lockBryan Newbold2019-12-172-286/+318
* pipfile: add langcodes and dateparser dependenciesBryan Newbold2019-12-172-1/+44
* write diagnostic messages to stderrMartin Czygan2019-12-161-2/+2
* Merge branch 'martin-importers-common-doc-fix' into 'master'Martin Czygan2019-12-141-13/+10
|\
| * complete parse_record docstringMartin Czygan2019-12-141-0/+6
| * Update EntityImporter docstring.Martin Czygan2019-12-131-13/+4
* | add ingest import file collision protectionBryan Newbold2019-12-131-0/+6
* | fix spn kafka topic env varBryan Newbold2019-12-131-1/+1
* | update ingest request schemaBryan Newbold2019-12-135-16/+44
* | remove default mimetype from ingest-file importerBryan Newbold2019-12-131-2/+1
* | revert accidentally commited test timingBryan Newbold2019-12-131-2/+2
* | ensure importer description arg isn't clobberedBryan Newbold2019-12-123-5/+5
* | tweaks to ingest-file transformBryan Newbold2019-12-121-13/+7
* | initial 'Save Paper Now' web formBryan Newbold2019-12-127-2/+228
* | more auth token vars in example.envBryan Newbold2019-12-121-0/+6
* | savepapernow result importerBryan Newbold2019-12-123-4/+89
* | flush importer editgroups every few minutesBryan Newbold2019-12-121-5/+20
* | EntityImporter: submit (not accept) modeBryan Newbold2019-12-121-2/+14
|/
* Merge branch 'bnewbold-ingest-oa-container' into 'master'bnewbold2019-12-126-3/+181
|\
| * container_issnl, not issnl, for ES release queryBryan Newbold2019-12-121-1/+1
| * improve argparse usageBryan Newbold2019-12-111-6/+4
| * simplify ES scroll deletion using param()Bryan Newbold2019-12-111-29/+29
| * add ingest-container command (new CLI tool)Bryan Newbold2019-12-101-0/+136
| * factor out some basic kafka helpersBryan Newbold2019-12-102-0/+23
| * add another ingest request source to whitelistBryan Newbold2019-12-101-2/+5
| * pipenv: add elasticsearch and elasticsearch-dsl librariesBryan Newbold2019-12-102-1/+19
* | improve argparse usageBryan Newbold2019-12-1110-78/+95
|/
* fix delete release history viewBryan Newbold2019-12-091-1/+1
* regression test for deleted entity history viewBryan Newbold2019-12-091-0/+25
* add missing underline in deleted entity web viewBryan Newbold2019-12-091-1/+1
* add basic test for crossref harvest API callBryan Newbold2019-12-062-0/+46
* refactor kafka producer in crossref harvesterBryan Newbold2019-12-061-21/+26