aboutsummaryrefslogtreecommitdiffstats
path: root/extra/bulk_edits/2022-03-08_chocula.md
blob: 1877a236ba9eb82c0aa69e6bef310244d8914aeb (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31

Periodic import of chocula metadata updates.

## Prod Import

    date
    # Wed Mar  9 02:13:55 UTC 2022

    git log -n1
    # commit 72e3825893ae614fcd6c6ae8a513745bfefe36b2

    export FATCAT_AUTH_WORKER_JOURNAL_METADATA=[...]
    head -n100 /srv/fatcat/datasets/chocula_fatcat_export.2022-03-08.json | ./fatcat_import.py chocula --do-updates -
    # Counter({'total': 100, 'exists': 85, 'exists-skip-update': 85, 'update': 14, 'insert': 1, 'skip': 0})

Some of these are just "as of" date updates on DOAJ metadata, but most are
"good". Lots of KBART holding dates incremented by a year (to include 2022).

    time cat /srv/fatcat/datasets/chocula_fatcat_export.2022-03-08.json | ./fatcat_import.py chocula --do-updates -


    Counter({'total': 184950, 'exists': 151925, 'exists-skip-update': 151655, 'update': 29953, 'insert': 3072
    , 'exists-by-issnl': 270, 'skip': 0})

    real    11m7.011s
    user    4m48.705s
    sys     0m16.761s

Great!

Now update stats, following `extra/container_count_update/README.md`.