aboutsummaryrefslogtreecommitdiffstats
Commit message (Expand)AuthorAgeFilesLines
* more publisher_type pattern matchingBryan Newbold2022-07-212-7/+12
* more homepage domains to ignore (and resort)Bryan Newbold2022-07-211-28/+33
* Makefile: new export-fatcat helperBryan Newbold2022-07-061-0/+4
* update sources (new snapshot)Bryan Newbold2022-07-062-2/+5
* sources: 2022-03-08 snapshot (old)Bryan Newbold2022-04-181-6/+6
* CI: switch to ubuntu focalBryan Newbold2022-02-041-1/+1
* pipenv: update lock fileBryan Newbold2022-02-031-243/+215
* pipenv: black (code style tool) has stable release; add some more type annota...Bryan Newbold2022-02-031-4/+3
* in fatcat exports, skip 'UNKNOWN_TITLE'Bryan Newbold2021-11-301-0/+5
* update sourcesBryan Newbold2021-11-301-4/+4
* handle homepage check with no status (skip, etc)Bryan Newbold2021-11-301-1/+1
* make fmtBryan Newbold2021-11-303-27/+29
* simplify homepage URL handling code a bitBryan Newbold2021-11-301-12/+14
* improve homepage URL filteringBryan Newbold2021-11-301-14/+28
* move skip logic from Makefile to check_issn_urlsBryan Newbold2021-11-302-1/+19
* openalex test casesBryan Newbold2021-11-302-0/+48
* check_issn_urls: remove ProtocolError (doesn't exist?)Bryan Newbold2021-11-301-5/+1
* make: don't run lint as part of testBryan Newbold2021-11-301-1/+1
* check_issn_urls.py: yet more hacks in exceptionsBryan Newbold2021-11-241-15/+41
* make: homepage-status skip some large publisher domains for speedBryan Newbold2021-11-241-2/+2
* more HomepageUrl filteringBryan Newbold2021-11-241-0/+3
* codespell fix minor typos (there are some more in actual codeBryan Newbold2021-11-244-6/+6
* yet another possible connection errorBryan Newbold2021-11-231-0/+4
* sources.toml: container status updateBryan Newbold2021-11-231-1/+1
* openalex in sources.tomlBryan Newbold2021-11-221-0/+6
* add openalex directory sourceBryan Newbold2021-11-223-18/+77
* 2021-11-20 sources snapshot (upload in progress)Bryan Newbold2021-11-221-5/+6
* update TODOBryan Newbold2021-11-191-0/+4
* update sources for new snapshotBryan Newbold2021-05-281-2/+2
* update sourcesBryan Newbold2021-04-231-6/+6
* make fmtBryan Newbold2021-04-238-11/+28
* doaj: updates for new file format; removed some fields/metadataBryan Newbold2021-04-233-85/+77
* gitlab CI: small tweaksBryan Newbold2020-12-281-1/+3
* gitlab CI: reduce apt packages installedBryan Newbold2020-12-281-1/+1
* pipenv: lock pycountry to backwards-compatible versionBryan Newbold2020-12-281-1/+1
* update to python3.8Bryan Newbold2020-12-284-176/+211
* makefile: 'test' does not run 'lint' (doc fix)Bryan Newbold2020-12-281-1/+1
* missing homepages update (2020-10-13)Bryan Newbold2020-12-281-0/+13
* update TODOBryan Newbold2020-12-281-0/+20
* back-commit 2020-11-19 configBryan Newbold2020-12-071-6/+6
* SIM: cap maximum year of coverageBryan Newbold2020-12-071-0/+3
* update TODOBryan Newbold2020-10-081-8/+7
* sources: update metadata snapshotBryan Newbold2020-10-081-2/+2
* estimate coverage change from new coverage holdingsBryan Newbold2020-10-081-0/+22
* database support for scholarsportal and cariniana preservation holdingsBryan Newbold2020-10-087-1/+198
* make: entrez.txt, not entrez.csvBryan Newbold2020-10-081-2/+2
* vanished_inactive: more tolerant handling of unicode BOMBryan Newbold2020-10-081-1/+2
* basic ONIX XML-to-JSON converterBryan Newbold2020-10-081-0/+151
* fix typo in sourcesBryan Newbold2020-10-081-1/+1
* util: parse ISSN format with extra spacesBryan Newbold2020-09-131-0/+2