aboutsummaryrefslogtreecommitdiffstats
path: root/notes/fatcat_sim_intersection.md
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2020-06-04 14:42:33 -0700
committerBryan Newbold <bnewbold@archive.org>2020-06-04 14:42:33 -0700
commitf71715517c7d933859ef9a5c5df3929f78c7a93d (patch)
tree7347467957564d1835e3ae8175aae6c4680ed633 /notes/fatcat_sim_intersection.md
parent368a441618426595d451eadd8179f6fa8ecfe3e9 (diff)
downloadfatcat-scholar-f71715517c7d933859ef9a5c5df3929f78c7a93d.tar.gz
fatcat-scholar-f71715517c7d933859ef9a5c5df3929f78c7a93d.zip
add WIP notes to repo
Diffstat (limited to 'notes/fatcat_sim_intersection.md')
-rw-r--r--notes/fatcat_sim_intersection.md22
1 files changed, 22 insertions, 0 deletions
diff --git a/notes/fatcat_sim_intersection.md b/notes/fatcat_sim_intersection.md
new file mode 100644
index 0000000..43500ec
--- /dev/null
+++ b/notes/fatcat_sim_intersection.md
@@ -0,0 +1,22 @@
+
+investigate how many fatcat releases match to SIM:
+- dump archive.org SIM collection-level metadata
+- dump archive.org issue item-level metadata
+- releases with: in_sim, volume, issue, page, year (month?)
+ => 22m in_ia_sim
+ => 1.1m in_ia_sim preservation:none
+ => 20m in_ia_sim volume
+ => 20m in_ia_sim volume year
+ => 19m in_ia_sim volume pages
+ => 5m in_ia_sim volume year date
+ => 7m in_ia_sim volume issue
+ => 7m in_ia_sim volume issue pages
+ => 6m in_ia_sim volume issue pages first_page
+ => 5.3m in_ia_sim volume issue pages first_page in_web:false
+ => 0.7m in_ia_sim volume issue pages first_page preservation:none
+ => 2.5m in_ia_sim volume issue pages first_page date
+- how many (any?) SIM journals with no fatcat container
+- how many SIM journals/issues/years with ~no fatcat releases
+
+at least some (release_jpruczlec5gsjpbc2cbvwedsdy) have updated crossref
+metadata with issue numbers