diff options
author | Bryan Newbold <bnewbold@archive.org> | 2020-06-04 14:42:33 -0700 |
---|---|---|
committer | Bryan Newbold <bnewbold@archive.org> | 2020-06-04 14:42:33 -0700 |
commit | f71715517c7d933859ef9a5c5df3929f78c7a93d (patch) | |
tree | 7347467957564d1835e3ae8175aae6c4680ed633 /notes/fatcat_sim_intersection.md | |
parent | 368a441618426595d451eadd8179f6fa8ecfe3e9 (diff) | |
download | fatcat-scholar-f71715517c7d933859ef9a5c5df3929f78c7a93d.tar.gz fatcat-scholar-f71715517c7d933859ef9a5c5df3929f78c7a93d.zip |
add WIP notes to repo
Diffstat (limited to 'notes/fatcat_sim_intersection.md')
-rw-r--r-- | notes/fatcat_sim_intersection.md | 22 |
1 files changed, 22 insertions, 0 deletions
diff --git a/notes/fatcat_sim_intersection.md b/notes/fatcat_sim_intersection.md new file mode 100644 index 0000000..43500ec --- /dev/null +++ b/notes/fatcat_sim_intersection.md @@ -0,0 +1,22 @@ + +investigate how many fatcat releases match to SIM: +- dump archive.org SIM collection-level metadata +- dump archive.org issue item-level metadata +- releases with: in_sim, volume, issue, page, year (month?) + => 22m in_ia_sim + => 1.1m in_ia_sim preservation:none + => 20m in_ia_sim volume + => 20m in_ia_sim volume year + => 19m in_ia_sim volume pages + => 5m in_ia_sim volume year date + => 7m in_ia_sim volume issue + => 7m in_ia_sim volume issue pages + => 6m in_ia_sim volume issue pages first_page + => 5.3m in_ia_sim volume issue pages first_page in_web:false + => 0.7m in_ia_sim volume issue pages first_page preservation:none + => 2.5m in_ia_sim volume issue pages first_page date +- how many (any?) SIM journals with no fatcat container +- how many SIM journals/issues/years with ~no fatcat releases + +at least some (release_jpruczlec5gsjpbc2cbvwedsdy) have updated crossref +metadata with issue numbers |