diff options
author | Bryan Newbold <bnewbold@archive.org> | 2019-04-18 11:14:53 -0700 |
---|---|---|
committer | Bryan Newbold <bnewbold@archive.org> | 2019-04-18 11:14:53 -0700 |
commit | e752e0974cdd188fda26b0f829573d78bb2c57ef (patch) | |
tree | b76fe1314da253236e4667423f5d54ea1435da4e /README.md | |
parent | cd566e0b44cfa7cb110b60158aa029189e2d03ff (diff) | |
download | arabesque-e752e0974cdd188fda26b0f829573d78bb2c57ef.tar.gz arabesque-e752e0974cdd188fda26b0f829573d78bb2c57ef.zip |
add postprocess command to README
Diffstat (limited to 'README.md')
-rw-r--r-- | README.md | 1 |
1 files changed, 1 insertions, 0 deletions
@@ -24,6 +24,7 @@ The simplest usage is to specify a seed-url/identifier mapping, a crawl log, and an output database file name: ./arabesque.py everything examples/crawl.log examples/seed_doi.tsv output.sqlite3 + ./arabesque.py postprocess examples/grobid_status_codes.tsv output.sqlite3 Then generate an HTML report: |