summaryrefslogtreecommitdiffstats
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2020-07-01 15:04:29 -0700
committerBryan Newbold <bnewbold@archive.org>2020-07-01 15:04:29 -0700
commit5819da753c7b2c5efcd05b8c441bb3282ad30a0f (patch)
treeeddce812d536513cf511824dbf88000db0c70e33
parent2dea232afd02dc49ccbf114ab7fa1edf1b69e92d (diff)
downloadfatcat-scholar-5819da753c7b2c5efcd05b8c441bb3282ad30a0f.tar.gz
fatcat-scholar-5819da753c7b2c5efcd05b8c441bb3282ad30a0f.zip
update README instructions for issue_db generation
-rw-r--r--README.md5
1 files changed, 3 insertions, 2 deletions
diff --git a/README.md b/README.md
index 02ddf7d..79a9dff 100644
--- a/README.md
+++ b/README.md
@@ -39,8 +39,9 @@ Generate complete SIM issue database:
cat data/sim_collections.tsv | parallel -j4 ia metadata {} | jq . -c | pv -l > data/sim_collections.json
cat data/sim_items.tsv | parallel -j8 ia metadata {} | jq . -c | pv -l > data/sim_items.json
- cat data/sim_collections.2020-05-15.json | pv -l | python -m fatcat_scholar.issue_db load_pubs
- cat data/sim_items.2020-05-15.json | pv -l | python -m fatcat_scholar.issue_db load_issues
+ python -m fatcat_scholar.issue_db init_db
+ cat data/sim_collections.json | pv -l | python -m fatcat_scholar.issue_db load_pubs
+ cat data/sim_items.json | pv -l | python -m fatcat_scholar.issue_db load_issues
python -m fatcat_scholar.issue_db load_counts
Create QA elasticsearch index (localhost):