summaryrefslogtreecommitdiffstats
path: root/python/fatcat_tools/importers/matched.py
Commit message (Collapse)AuthorAgeFilesLines
* typing: relatively simple type check fixesBryan Newbold2021-11-031-7/+7
| | | | | | | These mostly add new variable names so that existing variables aren't overwritten with a new type; delay coercing '{}' or '[]' to 'None' until the last minute; adding is-not-None checks to conditional clauses; and similar small changes.
* typing: initial annotations on importersBryan Newbold2021-11-031-6/+9
| | | | | This commit just adds the type annotations, doesn't do fixes to code to make type checking pass.
* fmt (black): fatcat_tools/Bryan Newbold2021-11-021-45/+58
|
* lint: simple, safe inline lint fixesBryan Newbold2021-11-021-2/+2
| | | | '==' vs 'is'; 'not a in b' vs 'a not in b'; etc
* lint/fmt: remove all 'import *'Bryan Newbold2021-11-021-2/+3
|
* lint (flake8) tool python filesBryan Newbold2020-07-011-7/+1
|
* ensure importer description arg isn't clobberedBryan Newbold2019-12-121-2/+1
|
* refactor all python source for client lib nameBryan Newbold2019-09-051-9/+9
|
* add inflight edit protection to matched importerBryan Newbold2019-06-241-1/+8
|
* fix typo; do arxiv-specific match import hackBryan Newbold2019-06-241-3/+14
|
* fix syntax in existing.url cleanupBryan Newbold2019-06-241-1/+1
|
* fix existing updaterBryan Newbold2019-06-241-2/+3
|
* add minimal file URL cleanups to matched importerBryan Newbold2019-06-241-0/+8
|
* matched importer: urls, not urlBryan Newbold2019-06-241-1/+1
| | | | | | This matches the docs in the header. Previous matched imports were using 'cdx' objects with no 'dt' key, but this makes more sense. As far as I know the old 'url' code path was never actually used (or tested, derp).
* support extids in matched importerBryan Newbold2019-06-201-3/+26
|
* fix default mimetype (impacted pre-1923 files)Bryan Newbold2019-05-151-2/+7
|
* python implBryan Newbold2019-05-141-5/+6
|
* python implBryan Newbold2019-05-141-2/+2
|
* add limits to match importersBryan Newbold2019-04-231-1/+13
|
* editgroup description overrideBryan Newbold2019-04-221-2/+2
|
* matched importer shouldn't require waybackBryan Newbold2019-04-221-5/+7
|
* don't print missing DOIs, just countBryan Newbold2019-02-051-1/+3
|
* pass through kwargs (fixes bezerk imports)Bryan Newbold2019-01-291-1/+2
|
* fix rel/url order swapBryan Newbold2019-01-281-1/+1
|
* refactor _get_editgroup => get_editgroup_idBryan Newbold2019-01-241-1/+1
|
* refactor make_rel_urlBryan Newbold2019-01-241-14/+3
|
* importer bugfixesBryan Newbold2019-01-231-2/+6
|
* matched importer: bezerk mode to skip file updatesBryan Newbold2019-01-231-11/+5
|
* ftfy all over (needs Pipfile.lock)Bryan Newbold2019-01-231-1/+1
|
* refactor remaining importersBryan Newbold2019-01-221-79/+71
|
* more 'true' -> True query param fixesBryan Newbold2019-01-181-1/+1
|
* use full-on autoaccept modeBryan Newbold2019-01-111-4/+7
| | | | | | | | Now that editor_id is infered from token, don't *need* to create ahead of time. This backend change simplifies things greatly (either update an existing editgroup, or create new and *only* include entities in the batch transaction), at the cost of being able to configure the editgroup in any way, including setting a description.
* importers and tests all use new api-passingBryan Newbold2019-01-081-6/+12
|
* python impl of API ident harmonizationBryan Newbold2018-12-241-10/+10
|
* crossref importer checks for existing DOIsBryan Newbold2018-11-211-3/+3
|
* large refactor of python names/pathsBryan Newbold2018-11-151-2/+8
| | | | | | | - Add __init__.py files for fatcat_tools submodules, and use them in imports - Add a bunch of comments to files. - rename a number of classes and functions to be less verbose
* use Counter object instead of per-metric intsBryan Newbold2018-11-131-4/+4
|
* shuffle around fatcat_tools layoutBryan Newbold2018-11-131-0/+144