aboutsummaryrefslogtreecommitdiffstats
Commit message (Expand)AuthorAgeFilesLines
* gitlab-ci: copy env var in to place for testsBryan Newbold2021-10-271-0/+1
* fix type annotations for petabox body fetch helperBryan Newbold2021-10-265-8/+11
* small type annotation hackBryan Newbold2021-10-261-1/+1
* fileset: fix field renaming bug (caught by mypy)Bryan Newbold2021-10-261-2/+2
* fileset ingest: fix table name typo (via mypy)Bryan Newbold2021-10-261-1/+1
* update 'XXX' notes from fileset ingest developmentBryan Newbold2021-10-262-9/+6
* bugfix: setting html_biblio on ingest resultsBryan Newbold2021-10-262-2/+2
* lint collection membership (last lint for now)Bryan Newbold2021-10-267-32/+32
* commit updated flake8 lint configurationBryan Newbold2021-10-261-6/+10
* ingest fileset: fix silly import typoBryan Newbold2021-10-261-1/+1
* type annotations for persist workers; required some workBryan Newbold2021-10-261-66/+59
* ingest file HTTP API: fixes from type checkingBryan Newbold2021-10-261-3/+3
* more progress on type annotationsBryan Newbold2021-10-268-34/+55
* grobid: fix a bug with consolidate_mode header, exposed by type annotationsBryan Newbold2021-10-261-1/+2
* grobid: type annotationsBryan Newbold2021-10-261-9/+19
* type annotations on SandcrawlerWorkerBryan Newbold2021-10-261-46/+57
* more progress on type annotations and lintingBryan Newbold2021-10-2611-55/+87
* live tests: FTP wayback replay now returns 200, not 226Bryan Newbold2021-10-261-2/+2
* ia: more tweaks to delicate code to satisfy type checkerBryan Newbold2021-10-261-10/+12
* ia helpers: enforce max_redirects count correctlyBryan Newbold2021-10-261-1/+1
* set CDX request params are str, not int or datetimeBryan Newbold2021-10-261-3/+6
* bugfix: was setting 'from' parameter as a tuple, not a stringBryan Newbold2021-10-261-1/+1
* start type annotating IA helper codeBryan Newbold2021-10-261-37/+65
* start adding python type annotations to db and persist codeBryan Newbold2021-10-262-97/+124
* Makefile: don't fail on isort error (consider these minor)Bryan Newbold2021-10-261-1/+1
* tweak flake8 configBryan Newbold2021-10-261-2/+11
* flake8 clean (with current settings)Bryan Newbold2021-10-269-25/+24
* pipenv: import type annotations for requests and dateparserBryan Newbold2021-10-262-1/+19
* start handling trivial lint cleanups: unused imports, 'is None', etcBryan Newbold2021-10-2630-149/+86
* make fmtBryan Newbold2021-10-2659-1225/+1582
* tweak lint/fmt settingsBryan Newbold2021-10-262-4/+6
* update pytest warning filters (they are pretty expansive)Bryan Newbold2021-10-261-0/+3
* ingest_html: update trafilatura TEI-XML output kwargBryan Newbold2021-10-261-1/+1
* python: isort all importsBryan Newbold2021-10-2657-178/+207
* add pyproject.toml (for isort and yapf config), and update 'lint' and 'fmt' m...Bryan Newbold2021-10-262-3/+13
* pipenv: general update; add isort, yapf (over black), grobid_tei_xmlBryan Newbold2021-10-262-730/+880
* kafka monitoring commandsBryan Newbold2021-10-261-0/+4
* more small fileset ingest tweaksBryan Newbold2021-10-262-6/+21
* commit SPN account changesBryan Newbold2021-10-151-0/+14
* commit old ingest domain summaryBryan Newbold2021-10-151-0/+345
* python: more aggressive gitignoreBryan Newbold2021-10-151-0/+3
* persist support for ingest platform table, using existing persist workerBryan Newbold2021-10-153-4/+131
* sql fileset ingest table iterationBryan Newbold2021-10-151-12/+11
* document passing back platform_base_urlBryan Newbold2021-10-151-0/+1
* improve fileset ingest integration with file ingestBryan Newbold2021-10-154-5/+25
* more fileset iterationBryan Newbold2021-10-155-45/+81
* move SPNv2 'simple_get' logic to SPN clientBryan Newbold2021-10-153-52/+31
* filesets: iteration of implementation and docsBryan Newbold2021-10-155-96/+167
* updates to fileset ingest proposalBryan Newbold2021-10-152-239/+337
* fileset ingest notesBryan Newbold2021-10-151-3/+23