aboutsummaryrefslogtreecommitdiffstats
path: root/extra/bulk_edits/2022-07-06_chocula.md
blob: 86bf36fbfe5fd6794eff44b9cebfa909685722b0 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25

Periodic import of chocula metadata updates.

## Prod Import

    date
    # Wed Jul  6 23:29:47 UTC 2022

    git log -n1
    # aff3f40a5177dd6de4eee8ea7bca78df7a595bf3

    export FATCAT_AUTH_WORKER_JOURNAL_METADATA=[...]
    head -n100 /srv/fatcat/datasets/chocula_fatcat_export.2022-07-06.json | ./fatcat_import.py chocula --do-updates -
    # Counter({'total': 100, 'exists': 86, 'exists-skip-update': 83, 'update': 13, 'exists-by-issnl': 3, 'insert': 1, 'skip': 0})

Many updates are just KBART holding dates or DOAJ as-of dates, but that is fine
and expected.

    time cat /srv/fatcat/datasets/chocula_fatcat_export.2022-07-06.json | ./fatcat_import.py chocula --do-updates -
    # Counter({'total': 187480, 'exists': 155943, 'exists-skip-update': 151171, 'update': 30437, 'exists-by-issnl': 4772, 'insert': 1100, 'skip': 0})
    # real    10m28.081s
    # user    4m37.447s
    # sys     0m16.063s

Now update stats, following `extra/container_count_update/README.md`.