diff options
author | Bryan Newbold <bnewbold@archive.org> | 2018-08-20 18:43:11 -0700 |
---|---|---|
committer | Bryan Newbold <bnewbold@archive.org> | 2018-08-21 21:25:56 -0700 |
commit | 29eb22bc60193e6f919d98ec14a39ecd53477331 (patch) | |
tree | fd3c99ce853bc5036dc2572eb04c1d3b1f6b8a38 /mapreduce/backfill_hbase_from_cdx.py | |
parent | 39bf4b57cd552e8042bfa25565b390cb2a456ab0 (diff) | |
download | sandcrawler-29eb22bc60193e6f919d98ec14a39ecd53477331.tar.gz sandcrawler-29eb22bc60193e6f919d98ec14a39ecd53477331.zip |
use grobid0:metadata, not tei_json
This is for efficiency. I had forgotten that the extract script actually
writes this path!
Diffstat (limited to 'mapreduce/backfill_hbase_from_cdx.py')
0 files changed, 0 insertions, 0 deletions