aboutsummaryrefslogtreecommitdiffstats
Commit message (Collapse)AuthorAgeFilesLines
...
* live tests: FTP wayback replay now returns 200, not 226Bryan Newbold2021-10-261-2/+2
|
* ia: more tweaks to delicate code to satisfy type checkerBryan Newbold2021-10-261-10/+12
| | | | | Ran the 'live' wayback tests after this commit as a check, and worked (once FTP status code behavior change is fixed)
* ia helpers: enforce max_redirects count correctlyBryan Newbold2021-10-261-1/+1
| | | | | AKA, should run fetch even if max_redirects = 0; the first loop iteration is not a redirect.
* set CDX request params are str, not int or datetimeBryan Newbold2021-10-261-3/+6
| | | | This might be a bugfix, changing CDX lookup behavior?
* bugfix: was setting 'from' parameter as a tuple, not a stringBryan Newbold2021-10-261-1/+1
|
* start type annotating IA helper codeBryan Newbold2021-10-261-37/+65
|
* start adding python type annotations to db and persist codeBryan Newbold2021-10-262-97/+124
|
* Makefile: don't fail on isort error (consider these minor)Bryan Newbold2021-10-261-1/+1
|
* tweak flake8 configBryan Newbold2021-10-261-2/+11
|
* flake8 clean (with current settings)Bryan Newbold2021-10-269-25/+24
|
* pipenv: import type annotations for requests and dateparserBryan Newbold2021-10-262-1/+19
|
* start handling trivial lint cleanups: unused imports, 'is None', etcBryan Newbold2021-10-2630-149/+86
|
* make fmtBryan Newbold2021-10-2659-1225/+1582
|
* tweak lint/fmt settingsBryan Newbold2021-10-262-4/+6
|
* update pytest warning filters (they are pretty expansive)Bryan Newbold2021-10-261-0/+3
|
* ingest_html: update trafilatura TEI-XML output kwargBryan Newbold2021-10-261-1/+1
|
* python: isort all importsBryan Newbold2021-10-2657-178/+207
|
* add pyproject.toml (for isort and yapf config), and update 'lint' and 'fmt' ↵Bryan Newbold2021-10-262-3/+13
| | | | make targets
* pipenv: general update; add isort, yapf (over black), grobid_tei_xmlBryan Newbold2021-10-262-730/+880
|
* kafka monitoring commandsBryan Newbold2021-10-261-0/+4
|
* more small fileset ingest tweaksBryan Newbold2021-10-262-6/+21
|
* commit SPN account changesBryan Newbold2021-10-151-0/+14
|
* commit old ingest domain summaryBryan Newbold2021-10-151-0/+345
|
* python: more aggressive gitignoreBryan Newbold2021-10-151-0/+3
|
* persist support for ingest platform table, using existing persist workerBryan Newbold2021-10-153-4/+131
|
* sql fileset ingest table iterationBryan Newbold2021-10-151-12/+11
|
* document passing back platform_base_urlBryan Newbold2021-10-151-0/+1
|
* improve fileset ingest integration with file ingestBryan Newbold2021-10-154-5/+25
|
* more fileset iterationBryan Newbold2021-10-155-45/+81
|
* move SPNv2 'simple_get' logic to SPN clientBryan Newbold2021-10-153-52/+31
|
* filesets: iteration of implementation and docsBryan Newbold2021-10-155-96/+167
|
* updates to fileset ingest proposalBryan Newbold2021-10-152-239/+337
|
* fileset ingest notesBryan Newbold2021-10-151-3/+23
|
* fileset ingest: improve platform parsingBryan Newbold2021-10-151-12/+196
|
* fileset ingest: improve error handlingBryan Newbold2021-10-154-48/+106
|
* initial implementation of zenodo platform importBryan Newbold2021-10-151-0/+100
|
* initial figshare platform helperBryan Newbold2021-10-151-0/+95
|
* improvements to platform helpersBryan Newbold2021-10-153-34/+44
|
* component ingest support for dataverse files (individual)Bryan Newbold2021-10-152-13/+31
|
* progress on web ingest strategyBryan Newbold2021-10-153-12/+121
|
* fileset ingest progress for dataverseBryan Newbold2021-10-154-23/+291
|
* local-file version of gen_file_metadataBryan Newbold2021-10-153-3/+56
|
* progress on dataset ingestBryan Newbold2021-10-154-122/+333
|
* dataset ingest: start enumerating examplesBryan Newbold2021-10-151-0/+34
|
* ingest tool: always require ingest type as part of 'single' commandBryan Newbold2021-10-151-3/+3
|
* wrap up previous renaming workBryan Newbold2021-10-154-6/+4
|
* progress on fileset/dataset ingestBryan Newbold2021-10-154-0/+403
|
* scripts: example archiveorg-to-fileset importerBryan Newbold2021-10-151-0/+138
|
* initial dataset/fileset ingest proposalBryan Newbold2021-10-151-0/+185
|
* sql: initial ingest fileset tableBryan Newbold2021-10-151-0/+38
|