aboutsummaryrefslogtreecommitdiffstats
path: root/python/tests/test_ingest.py
Commit message (Collapse)AuthorAgeFilesLines
* make fmt (black 21.9b0)Bryan Newbold2021-10-271-126/+147
|
* more progress on type annotations and lintingBryan Newbold2021-10-261-1/+1
|
* start handling trivial lint cleanups: unused imports, 'is None', etcBryan Newbold2021-10-261-5/+5
|
* make fmtBryan Newbold2021-10-261-55/+64
|
* python: isort all importsBryan Newbold2021-10-261-3/+4
|
* refactor and expand wall/block/cookie URL patternsBryan Newbold2021-09-031-0/+14
|
* check for simple URL patterns that are usually paywalls or loginwallsBryan Newbold2020-08-111-0/+18
|
* pdfextract support in ingest workerBryan Newbold2020-06-251-0/+7
|
* ingest: add URL blocklist featureBryan Newbold2020-01-171-0/+17
| | | | And, temporarily, block zenodo and figshare.
* clarify ingest result schema and semanticsBryan Newbold2020-01-151-3/+19
|
* add postgrest checks to test mocksBryan Newbold2020-01-141-1/+9
|
* tests: don't use localhost as a responses mock hostBryan Newbold2020-01-141-2/+2
|
* refactor ingest to a loop, allowing multiple hopsBryan Newbold2020-01-091-2/+9
|
* add ingest test fileBryan Newbold2020-01-091-0/+120
Forgot to commit earlier!