aboutsummaryrefslogtreecommitdiffstats
path: root/python/tests/test_ingest.py
Commit message (Expand)AuthorAgeFilesLines
* start handling trivial lint cleanups: unused imports, 'is None', etcBryan Newbold2021-10-261-5/+5
* make fmtBryan Newbold2021-10-261-55/+64
* python: isort all importsBryan Newbold2021-10-261-3/+4
* refactor and expand wall/block/cookie URL patternsBryan Newbold2021-09-031-0/+14
* check for simple URL patterns that are usually paywalls or loginwallsBryan Newbold2020-08-111-0/+18
* pdfextract support in ingest workerBryan Newbold2020-06-251-0/+7
* ingest: add URL blocklist featureBryan Newbold2020-01-171-0/+17
* clarify ingest result schema and semanticsBryan Newbold2020-01-151-3/+19
* add postgrest checks to test mocksBryan Newbold2020-01-141-1/+9
* tests: don't use localhost as a responses mock hostBryan Newbold2020-01-141-2/+2
* refactor ingest to a loop, allowing multiple hopsBryan Newbold2020-01-091-2/+9
* add ingest test fileBryan Newbold2020-01-091-0/+120