aboutsummaryrefslogtreecommitdiffstats
Commit message (Expand)AuthorAgeFilesLines
...
| * | cleanups: tweaks to wayback CDX cleanup scriptsBryan Newbold2021-11-092-6/+21
| * | cleanups: initial lowercase DOI cleanup scriptBryan Newbold2021-11-091-0/+145
| * | wayback short ts: another regression test, and some small fmt/tweaksBryan Newbold2021-11-091-3/+38
| * | wayback cleanup: actually update entityBryan Newbold2021-11-091-2/+4
| * | imports: generic file cleanup removes exact duplicate URLsBryan Newbold2021-11-091-0/+9
| * | wayback short ts: add regression test for dupe URLsBryan Newbold2021-11-091-0/+44
| * | short wayback ts: initial cleanup script implementationBryan Newbold2021-11-091-0/+251
| * | wayback timestamps: updates to handle 4-digit caseBryan Newbold2021-11-092-11/+108
| * | start work on wayback short-timestamp cleanupBryan Newbold2021-11-092-0/+238
| |/
* | update crawlability docsBryan Newbold2021-11-101-1/+9
* | sitemap generation improvementsBryan Newbold2021-11-102-1/+2
* | start notes/proposal about 'crawlability' improvementsBryan Newbold2021-11-101-0/+68
* | pubmed: allow updates if PMCID does not exist yetBryan Newbold2021-11-101-1/+6
|/
* update CHANGELOG for recent developmentBryan Newbold2021-11-051-0/+26
* python tests: verify array sort orderBryan Newbold2021-11-054-20/+18
* api: add SQL 'ORDER BY' to many reads to stabilize API array orderingBryan Newbold2021-11-051-3/+14
* enable type annotation checking with flake8 by default ('make lint')Bryan Newbold2021-11-031-4/+2
* cleanups: create a separate JsonLinePusher for cleanup workers (distinct base...Bryan Newbold2021-11-033-4/+20
* facat_import.py: work around corner case in run_cdl_dash_dat()Bryan Newbold2021-11-031-1/+1
* datacite importer: remove unused 'year_only' variableBryan Newbold2021-11-031-2/+3
* web: work around remaining type annotation issuesBryan Newbold2021-11-032-11/+15
* ignore type errors in cors.py (third party code)Bryan Newbold2021-11-031-2/+2
* web: fix bytes/text warning loggingBryan Newbold2021-11-031-3/+3
* lint: remove unused importBryan Newbold2021-11-031-1/+0
* web config: add helper for coercing env vars into booleansBryan Newbold2021-11-031-3/+32
* web: add type annotationsBryan Newbold2021-11-0312-297/+347
* introduce 'AnyResponse' type for Flask viewsBryan Newbold2021-11-032-0/+15
* pubmed harvester: remove unused variablesBryan Newbold2021-11-031-2/+2
* pubmed harvester: explicit assertions to mark unreachable code pathsBryan Newbold2021-11-031-0/+2
* typing: add assertions to fatcat_tool code to make type assumptions explicitBryan Newbold2021-11-033-0/+3
* typing: add annotations to remaining fatcat_tools codeBryan Newbold2021-11-039-122/+186
* datacite: add comment about potential date parsing bugBryan Newbold2021-11-031-0/+1
* datacite importer: dateparser.date.DateDataParser()Bryan Newbold2021-11-031-1/+1
* more involved type wrangling and fixes for importersBryan Newbold2021-11-033-12/+14
* typing: relatively simple type check fixesBryan Newbold2021-11-0314-87/+82
* typing: initial annotations on importersBryan Newbold2021-11-0322-274/+443
* typing: first batch of python bulk type annotationsBryan Newbold2021-11-0321-139/+200
* importers: remove unused __main__ routineBryan Newbold2021-11-034-19/+0
* lint: resolve existing mypy type errorsBryan Newbold2021-11-0212-81/+125
* web: annotate 'app' as 'Any', and document whyBryan Newbold2021-11-021-1/+5
* flake8 config: update ignore list after big fmtBryan Newbold2021-11-021-3/+4
* re-fix some lint issues after big 'fmt'Bryan Newbold2021-11-022-4/+5
* fmt (black): fatcat_tools/Bryan Newbold2021-11-0243-3194/+4020
* fmt (black): fatcat_web/Bryan Newbold2021-11-0213-1297/+1992
* fmt (black): *.pyBryan Newbold2021-11-0211-715/+1110
* fmt (black): tests/Bryan Newbold2021-11-0255-1430/+1852
* python: isort everythingBryan Newbold2021-11-0285-184/+278
* revert '-vv' for pytest in Makefile (too verbose)Bryan Newbold2021-11-021-1/+1
* arabesque import 'hit' field is 1/0, not true/falseBryan Newbold2021-11-021-2/+2
* lint: simple, safe inline lint fixesBryan Newbold2021-11-0239-221/+220