From 043b35040e4385c674267aa88c4056bdfdd9cb6c Mon Sep 17 00:00:00 2001 From: Bryan Newbold Date: Thu, 3 Sep 2020 18:27:44 -0700 Subject: update notes and explore --- TODO.md | 16 +++++++++++++++- 1 file changed, 15 insertions(+), 1 deletion(-) (limited to 'TODO.md') diff --git a/TODO.md b/TODO.md index 29b1fe0..befbd48 100644 --- a/TODO.md +++ b/TODO.md @@ -1,4 +1,5 @@ + priorities: - coverage stats, particularly for longtail - `is_active` coverage @@ -10,9 +11,22 @@ priorities: ## Sources -- unpaywall journal-level classification +- preservation coverage + x hathitrust (huge!) + https://www.hathitrust.org/hathifiles_description + x PKP PLN (ONIX) + https://pkp.sfu.ca/pkp-pn/ + http://pkp.sfu.ca/files/pkppn/onix.csv + => Scholars Portal (canada) + received ONIX XML, hoping for KBART format + => Cariniana + => National Digital Preservation Program, China + => Library of Congress +- additional hathitrust (many more ISSNs/journals) +- unpaywall journal-level classification (OA color) => ask for journal-level dump or do munging - jurn matches + => somebody on github did an openrefine match - public scopus list (?) - scrape/munge public clarivate dumps - repositories (?) -- cgit v1.2.3