summaryrefslogtreecommitdiffstats
path: root/notes/fatcat_sim_intersection.md
diff options
context:
space:
mode:
Diffstat (limited to 'notes/fatcat_sim_intersection.md')
-rw-r--r--notes/fatcat_sim_intersection.md22
1 files changed, 22 insertions, 0 deletions
diff --git a/notes/fatcat_sim_intersection.md b/notes/fatcat_sim_intersection.md
new file mode 100644
index 0000000..43500ec
--- /dev/null
+++ b/notes/fatcat_sim_intersection.md
@@ -0,0 +1,22 @@
+
+investigate how many fatcat releases match to SIM:
+- dump archive.org SIM collection-level metadata
+- dump archive.org issue item-level metadata
+- releases with: in_sim, volume, issue, page, year (month?)
+ => 22m in_ia_sim
+ => 1.1m in_ia_sim preservation:none
+ => 20m in_ia_sim volume
+ => 20m in_ia_sim volume year
+ => 19m in_ia_sim volume pages
+ => 5m in_ia_sim volume year date
+ => 7m in_ia_sim volume issue
+ => 7m in_ia_sim volume issue pages
+ => 6m in_ia_sim volume issue pages first_page
+ => 5.3m in_ia_sim volume issue pages first_page in_web:false
+ => 0.7m in_ia_sim volume issue pages first_page preservation:none
+ => 2.5m in_ia_sim volume issue pages first_page date
+- how many (any?) SIM journals with no fatcat container
+- how many SIM journals/issues/years with ~no fatcat releases
+
+at least some (release_jpruczlec5gsjpbc2cbvwedsdy) have updated crossref
+metadata with issue numbers