summaryrefslogtreecommitdiffstats
path: root/python/fatcat_tools/importers/arabesque.py
Commit message (Collapse)AuthorAgeFilesLines
* importers: refactor imports of clean() and other normalization helpersBryan Newbold2021-11-101-1/+3
|
* typing: relatively simple type check fixesBryan Newbold2021-11-031-2/+2
| | | | | | | These mostly add new variable names so that existing variables aren't overwritten with a new type; delay coercing '{}' or '[]' to 'None' until the last minute; adding is-not-None checks to conditional clauses; and similar small changes.
* typing: initial annotations on importersBryan Newbold2021-11-031-5/+10
| | | | | This commit just adds the type annotations, doesn't do fixes to code to make type checking pass.
* fmt (black): fatcat_tools/Bryan Newbold2021-11-021-53/+60
|
* python: isort everythingBryan Newbold2021-11-021-1/+1
|
* arabesque import 'hit' field is 1/0, not true/falseBryan Newbold2021-11-021-2/+2
|
* lint: simple, safe inline lint fixesBryan Newbold2021-11-021-2/+2
| | | | '==' vs 'is'; 'not a in b' vs 'a not in b'; etc
* arabesque importer: ensure full 14-digit timestampsBryan Newbold2021-05-211-1/+3
|
* lint (flake8) tool python filesBryan Newbold2020-07-011-6/+1
|
* importers: control update behavior with more-standard flagBryan Newbold2020-01-061-1/+1
|
* ensure importer description arg isn't clobberedBryan Newbold2019-12-121-2/+1
|
* refactor duplicated b32_hex function in importersBryan Newbold2019-10-081-11/+1
|
* refactor all python source for client lib nameBryan Newbold2019-09-051-8/+8
|
* fix default mimetype (impacted pre-1923 files)Bryan Newbold2019-05-151-2/+2
|
* python implBryan Newbold2019-05-141-5/+6
|
* python implBryan Newbold2019-05-141-2/+2
|
* add limits to match importersBryan Newbold2019-04-231-1/+11
|
* arabesque importer does require timestamp/waybackBryan Newbold2019-04-221-0/+3
|
* handle API 400 in arabesque import (invalid extid)Bryan Newbold2019-04-191-7/+14
|
* fix arabesque importer crawl_id None bugBryan Newbold2019-04-181-1/+1
|
* mechanism to not double-update entitiesBryan Newbold2019-04-181-1/+6
|
* minor arabesque tweaksBryan Newbold2019-04-181-0/+2
|
* arabesque importer does fewer updatesBryan Newbold2019-04-181-1/+8
|
* arabesque importerBryan Newbold2019-04-181-0/+165