aboutsummaryrefslogtreecommitdiffstats
Commit message (Collapse)AuthorAgeFilesLines
* more publisher_type pattern matchingBryan Newbold2022-07-212-7/+12
|
* more homepage domains to ignore (and resort)Bryan Newbold2022-07-211-28/+33
|
* Makefile: new export-fatcat helperBryan Newbold2022-07-061-0/+4
|
* update sources (new snapshot)Bryan Newbold2022-07-062-2/+5
|
* sources: 2022-03-08 snapshot (old)Bryan Newbold2022-04-181-6/+6
|
* CI: switch to ubuntu focalBryan Newbold2022-02-041-1/+1
|
* pipenv: update lock fileBryan Newbold2022-02-031-243/+215
|
* pipenv: black (code style tool) has stable release; add some more type ↵Bryan Newbold2022-02-031-4/+3
| | | | annotations
* in fatcat exports, skip 'UNKNOWN_TITLE'Bryan Newbold2021-11-301-0/+5
|
* update sourcesBryan Newbold2021-11-301-4/+4
|
* handle homepage check with no status (skip, etc)Bryan Newbold2021-11-301-1/+1
|
* make fmtBryan Newbold2021-11-303-27/+29
|
* simplify homepage URL handling code a bitBryan Newbold2021-11-301-12/+14
|
* improve homepage URL filteringBryan Newbold2021-11-301-14/+28
|
* move skip logic from Makefile to check_issn_urlsBryan Newbold2021-11-302-1/+19
|
* openalex test casesBryan Newbold2021-11-302-0/+48
|
* check_issn_urls: remove ProtocolError (doesn't exist?)Bryan Newbold2021-11-301-5/+1
|
* make: don't run lint as part of testBryan Newbold2021-11-301-1/+1
|
* check_issn_urls.py: yet more hacks in exceptionsBryan Newbold2021-11-241-15/+41
|
* make: homepage-status skip some large publisher domains for speedBryan Newbold2021-11-241-2/+2
|
* more HomepageUrl filteringBryan Newbold2021-11-241-0/+3
|
* codespell fix minor typos (there are some more in actual codeBryan Newbold2021-11-244-6/+6
|
* yet another possible connection errorBryan Newbold2021-11-231-0/+4
|
* sources.toml: container status updateBryan Newbold2021-11-231-1/+1
|
* openalex in sources.tomlBryan Newbold2021-11-221-0/+6
|
* add openalex directory sourceBryan Newbold2021-11-223-18/+77
| | | | | | Always run as day-specific ("TODAY") commands. Add timeouts so command actually completes reasonably.
* 2021-11-20 sources snapshot (upload in progress)Bryan Newbold2021-11-221-5/+6
|
* update TODOBryan Newbold2021-11-191-0/+4
|
* update sources for new snapshotBryan Newbold2021-05-281-2/+2
|
* update sourcesBryan Newbold2021-04-231-6/+6
|
* make fmtBryan Newbold2021-04-238-11/+28
|
* doaj: updates for new file format; removed some fields/metadataBryan Newbold2021-04-233-85/+77
|
* gitlab CI: small tweaksBryan Newbold2020-12-281-1/+3
|
* gitlab CI: reduce apt packages installedBryan Newbold2020-12-281-1/+1
|
* pipenv: lock pycountry to backwards-compatible versionBryan Newbold2020-12-281-1/+1
|
* update to python3.8Bryan Newbold2020-12-284-176/+211
|
* makefile: 'test' does not run 'lint' (doc fix)Bryan Newbold2020-12-281-1/+1
|
* missing homepages update (2020-10-13)Bryan Newbold2020-12-281-0/+13
|
* update TODOBryan Newbold2020-12-281-0/+20
|
* back-commit 2020-11-19 configBryan Newbold2020-12-071-6/+6
|
* SIM: cap maximum year of coverageBryan Newbold2020-12-071-0/+3
|
* update TODOBryan Newbold2020-10-081-8/+7
|
* sources: update metadata snapshotBryan Newbold2020-10-081-2/+2
|
* estimate coverage change from new coverage holdingsBryan Newbold2020-10-081-0/+22
|
* database support for scholarsportal and cariniana preservation holdingsBryan Newbold2020-10-087-1/+198
|
* make: entrez.txt, not entrez.csvBryan Newbold2020-10-081-2/+2
|
* vanished_inactive: more tolerant handling of unicode BOMBryan Newbold2020-10-081-1/+2
|
* basic ONIX XML-to-JSON converterBryan Newbold2020-10-081-0/+151
|
* fix typo in sourcesBryan Newbold2020-10-081-1/+1
|
* util: parse ISSN format with extra spacesBryan Newbold2020-09-131-0/+2
|