summaryrefslogtreecommitdiffstats
path: root/notes/bulk_edits/CHANGELOG.md
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@robocracy.org>2020-01-19 09:58:06 -0800
committerBryan Newbold <bnewbold@robocracy.org>2020-01-19 09:58:18 -0800
commit5e24cdf19644b7f821b372083f5c616ba47ec6c5 (patch)
tree8dc9c8c1a121861cfac28423d112799b5bec198c /notes/bulk_edits/CHANGELOG.md
parent0bdd96ced29b86cb15133b27038301bc9eecef30 (diff)
downloadfatcat-5e24cdf19644b7f821b372083f5c616ba47ec6c5.tar.gz
fatcat-5e24cdf19644b7f821b372083f5c616ba47ec6c5.zip
basic notes in bulk edit changelog
Diffstat (limited to 'notes/bulk_edits/CHANGELOG.md')
-rw-r--r--notes/bulk_edits/CHANGELOG.md7
1 files changed, 7 insertions, 0 deletions
diff --git a/notes/bulk_edits/CHANGELOG.md b/notes/bulk_edits/CHANGELOG.md
index 2db0c72d..172528da 100644
--- a/notes/bulk_edits/CHANGELOG.md
+++ b/notes/bulk_edits/CHANGELOG.md
@@ -14,6 +14,13 @@ This file should not turn in to a TODO list!
Imported around 2,500 new containers (journals, by ISSN-L) from chocula
analysis script.
+Imported DOIs from Datacite (around 16 million, plus or minus a couple
+million).
+
+Imported new release entities from 2020 Pubmed/MEDLINE baseline. This import
+included only new Pubmed works cataloged in 2019 (up until December or so).
+Only a few hundred thousand new release entities.
+
## 2019-12
Started continuous harvesting Datacite DOI metadata; first date harvested was