summaryrefslogtreecommitdiffstats
path: root/extra/bulk_edits/CHANGELOG.md
diff options
context:
space:
mode:
Diffstat (limited to 'extra/bulk_edits/CHANGELOG.md')
-rw-r--r--extra/bulk_edits/CHANGELOG.md42
1 files changed, 42 insertions, 0 deletions
diff --git a/extra/bulk_edits/CHANGELOG.md b/extra/bulk_edits/CHANGELOG.md
index 278dc1d8..716c95d6 100644
--- a/extra/bulk_edits/CHANGELOG.md
+++ b/extra/bulk_edits/CHANGELOG.md
@@ -9,6 +9,48 @@ this file should probably get merged into the guide at some point.
This file should not turn in to a TODO list!
+## 2022-07
+
+Ran a journal-level metadata update, using chocula.
+
+Cleaned up just under 500 releases with missing `container_id` from an older
+DOAJ article import.
+
+Imported roughly 100k releases from DOAJ, new since 2022-04.
+
+Imported roughly 2.7 million new ORCiD `creator` entities, using the 2021 dump
+(first update since 2020 dump).
+
+Imported almost 1 million new DOI release entities from JALC, first update in
+more than a year.
+
+Imported at least 400 new dblp containers, and an unknown number of new dblp
+release entities.
+
+Cleaned up about a thousand containers with incorrect `publisher_type`, based
+on current publisher name. Further updates will populate after the next chocula
+import.
+
+Ran a second batch of journal-level metadata updates, from chocula, resulting
+in a couple thousand updated entities.
+
+
+## 2022-04
+
+Imported some initial fileset entities.
+
+Updated about 25k file entities from isiarticles.com, which are samples (spam
+for translation service) to remove release linkage and set
+`content_scope=sample` (similar to the springer "page one" case).
+
+## 2022-03
+
+Ran a journal-level metadata update, using chocula.
+
+Run a DOAJ article-level metadata import, yielding a couple hundred thousand
+new release entities. Crawling and bulk ingest of HTML and PDF fulltext for
+these articles also started.
+
## 2022-02
- removed `container_id` linkage for some Datacite DOI releases which are