diff options
author | Bryan Newbold <bnewbold@robocracy.org> | 2022-03-22 13:18:21 -0700 |
---|---|---|
committer | Bryan Newbold <bnewbold@robocracy.org> | 2022-03-22 13:18:21 -0700 |
commit | 51d754c3fd0cbabb2e81195e2cf70384ed36dad8 (patch) | |
tree | 5df20724fd87e4c8f21d7a5aed5f67a7caddab89 /extra/bulk_edits/2022-03-08_chocula.md | |
parent | 91e4cedb00d1d2a5003f331880290a6e600ee6b5 (diff) | |
download | fatcat-51d754c3fd0cbabb2e81195e2cf70384ed36dad8.tar.gz fatcat-51d754c3fd0cbabb2e81195e2cf70384ed36dad8.zip |
document recent bulk metadata edits/imports
Diffstat (limited to 'extra/bulk_edits/2022-03-08_chocula.md')
-rw-r--r-- | extra/bulk_edits/2022-03-08_chocula.md | 31 |
1 files changed, 31 insertions, 0 deletions
diff --git a/extra/bulk_edits/2022-03-08_chocula.md b/extra/bulk_edits/2022-03-08_chocula.md new file mode 100644 index 00000000..1877a236 --- /dev/null +++ b/extra/bulk_edits/2022-03-08_chocula.md @@ -0,0 +1,31 @@ + +Periodic import of chocula metadata updates. + +## Prod Import + + date + # Wed Mar 9 02:13:55 UTC 2022 + + git log -n1 + # commit 72e3825893ae614fcd6c6ae8a513745bfefe36b2 + + export FATCAT_AUTH_WORKER_JOURNAL_METADATA=[...] + head -n100 /srv/fatcat/datasets/chocula_fatcat_export.2022-03-08.json | ./fatcat_import.py chocula --do-updates - + # Counter({'total': 100, 'exists': 85, 'exists-skip-update': 85, 'update': 14, 'insert': 1, 'skip': 0}) + +Some of these are just "as of" date updates on DOAJ metadata, but most are +"good". Lots of KBART holding dates incremented by a year (to include 2022). + + time cat /srv/fatcat/datasets/chocula_fatcat_export.2022-03-08.json | ./fatcat_import.py chocula --do-updates - + + + Counter({'total': 184950, 'exists': 151925, 'exists-skip-update': 151655, 'update': 29953, 'insert': 3072 + , 'exists-by-issnl': 270, 'skip': 0}) + + real 11m7.011s + user 4m48.705s + sys 0m16.761s + +Great! + +Now update stats, following `extra/container_count_update/README.md`. |