summaryrefslogtreecommitdiffstats
path: root/README.md
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2020-06-29 22:23:16 -0700
committerBryan Newbold <bnewbold@archive.org>2020-06-29 22:23:16 -0700
commit16e5f6d279c34856d542da98b042145c8995064f (patch)
tree0e874c1a5154bb09077e3f93c1094aef983907e7 /README.md
parentddf41a84f6cde6d5489a291a39f026e7c2672b87 (diff)
downloadfatcat-scholar-16e5f6d279c34856d542da98b042145c8995064f.tar.gz
fatcat-scholar-16e5f6d279c34856d542da98b042145c8995064f.zip
update COVID-19 ingest for refactors
Diffstat (limited to 'README.md')
-rw-r--r--README.md4
1 files changed, 2 insertions, 2 deletions
diff --git a/README.md b/README.md
index 84f0722..02ddf7d 100644
--- a/README.md
+++ b/README.md
@@ -54,10 +54,10 @@ Fetch "heavy" fulltext documents (JSON) for full SIM database:
Re-use existing COVID-19 database to index releases:
- cat /srv/fatcat_covid19/metadata/fatcat_hits.2020-04-27.enrich.json \
+ cat /srv/fatcat_covid19/metadata/2020-06-24/fatcat_hits.enrich.json \
| jq -c .fatcat_release \
| rg -v "^null" \
- | parallel -j8 --linebuffer --round-robin --pipe python -m fatcat_scholar.work_pipeline run_releases --fulltext-cache-dir /srv/fatcat_covid19/fulltext_web \
+ | parallel -j8 --linebuffer --round-robin --pipe python -m fatcat_scholar.work_pipeline run_releases \
| pv -l \
| gzip > data/work_intermediate.json.gz