diff options
author | Bryan Newbold <bnewbold@robocracy.org> | 2018-09-28 18:01:21 -0700 |
---|---|---|
committer | Bryan Newbold <bnewbold@robocracy.org> | 2018-10-12 15:33:19 -0400 |
commit | 619410dd1cf19b1d4dc9b00b5b0c31e253264f8e (patch) | |
tree | 6fddfb6c09eef563f5d87e1646a696fb95f4a7a6 /python/README_import.md | |
parent | f18d985df6dbe1b96d0fa0df06a57a165982531a (diff) | |
download | fatcat-619410dd1cf19b1d4dc9b00b5b0c31e253264f8e.tar.gz fatcat-619410dd1cf19b1d4dc9b00b5b0c31e253264f8e.zip |
update README_import with GROBID command
Diffstat (limited to 'python/README_import.md')
-rw-r--r-- | python/README_import.md | 2 |
1 files changed, 2 insertions, 0 deletions
diff --git a/python/README_import.md b/python/README_import.md index 38064a97..d3bbaddd 100644 --- a/python/README_import.md +++ b/python/README_import.md @@ -61,3 +61,5 @@ Unknown speed! # ... but do on the second zcat /srv/fatcat/datasets/2018-08-27-2352.17-matchcrossref.insertable.json.gz | pv -l | time parallel -j12 --round-robin --pipe ./fatcat_import.py import-matched - + # GROBID extracted (release+file) + time zcat /srv/fatcat/datasets/2018-09-23-0405.30-dumpgrobidmetainsertable.longtail_join.filtered.tsv.gz | pv -l | time parallel -j12 --round-robin --pipe ./fatcat_import.py import-grobid-metadata - |