aboutsummaryrefslogtreecommitdiffstats
path: root/python/fatcat_tools/importers/doaj_article.py
Commit message (Collapse)AuthorAgeFilesLines
* add fuzzy match filtering to DOAJ importerBryan Newbold2020-12-161-2/+9
| | | | | | | | | | | In this default configuration, any entities with a fuzzy match (even "ambiguous") will be skipped at import time, to prevent creating duplicates. This is conservative towards not creating new/duplicate entities. In the future, as we get more confidence in fuzzy match/verification, we can start to ignore AMBIGUOUS, handle EXACT as same release, and merge STRONG (and WEAK?) matches under the same work entity.
* doaj: fix update code path (getattr not __dict__)Bryan Newbold2020-11-201-4/+3
| | | | Also add missing code coverage for update path (disabled by default).
* DOAJ: handle empty identifier 'id' caseBryan Newbold2020-11-201-0/+2
|
* tweak DOAJ importer class args and default for do_updatesBryan Newbold2020-11-191-2/+2
|
* implement remainder of DOAJ article importerBryan Newbold2020-11-191-57/+125
|
* initial implementation of DOAJ importerBryan Newbold2020-11-191-0/+289
Several things to finish implementing and polish.