From f71715517c7d933859ef9a5c5df3929f78c7a93d Mon Sep 17 00:00:00 2001 From: Bryan Newbold Date: Thu, 4 Jun 2020 14:42:33 -0700 Subject: add WIP notes to repo --- notes/fatcat_sim_intersection.md | 22 ++++++++++++++++++++++ 1 file changed, 22 insertions(+) create mode 100644 notes/fatcat_sim_intersection.md (limited to 'notes/fatcat_sim_intersection.md') diff --git a/notes/fatcat_sim_intersection.md b/notes/fatcat_sim_intersection.md new file mode 100644 index 0000000..43500ec --- /dev/null +++ b/notes/fatcat_sim_intersection.md @@ -0,0 +1,22 @@ + +investigate how many fatcat releases match to SIM: +- dump archive.org SIM collection-level metadata +- dump archive.org issue item-level metadata +- releases with: in_sim, volume, issue, page, year (month?) + => 22m in_ia_sim + => 1.1m in_ia_sim preservation:none + => 20m in_ia_sim volume + => 20m in_ia_sim volume year + => 19m in_ia_sim volume pages + => 5m in_ia_sim volume year date + => 7m in_ia_sim volume issue + => 7m in_ia_sim volume issue pages + => 6m in_ia_sim volume issue pages first_page + => 5.3m in_ia_sim volume issue pages first_page in_web:false + => 0.7m in_ia_sim volume issue pages first_page preservation:none + => 2.5m in_ia_sim volume issue pages first_page date +- how many (any?) SIM journals with no fatcat container +- how many SIM journals/issues/years with ~no fatcat releases + +at least some (release_jpruczlec5gsjpbc2cbvwedsdy) have updated crossref +metadata with issue numbers -- cgit v1.2.3