aboutsummaryrefslogtreecommitdiffstats
path: root/pig/filter-cdx-pdfs.pig
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2020-02-12 20:33:31 -0800
committerBryan Newbold <bnewbold@archive.org>2020-02-12 20:33:34 -0800
commit4aec6410c2318972240ded2bce5f68706aae18df (patch)
tree1c723f7ff91205073031a5046ad33dc20da28d02 /pig/filter-cdx-pdfs.pig
parentf269709baea5d6e95ab101eb8d030ecae9de7e77 (diff)
downloadsandcrawler-4aec6410c2318972240ded2bce5f68706aae18df.tar.gz
sandcrawler-4aec6410c2318972240ded2bce5f68706aae18df.zip
pdftrio JSON object as top-level in Kafka results
To be same as GROBID results
Diffstat (limited to 'pig/filter-cdx-pdfs.pig')
0 files changed, 0 insertions, 0 deletions