aboutsummaryrefslogtreecommitdiffstats
path: root/python_hadoop/grobid2json.py
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2020-08-05 13:06:58 -0700
committerBryan Newbold <bnewbold@archive.org>2020-08-05 13:06:58 -0700
commitf4c2800109fe14af19137eac9760026f0efb0c03 (patch)
tree831fb395529382a916b6bdcc4c02be0156574f9b /python_hadoop/grobid2json.py
parentae531a3314742deb1bdd2560ffbcaa2d1f8d829b (diff)
downloadsandcrawler-f4c2800109fe14af19137eac9760026f0efb0c03.tar.gz
sandcrawler-f4c2800109fe14af19137eac9760026f0efb0c03.zip
more bad PDF sha1; print sha1 before poppler extract
Diffstat (limited to 'python_hadoop/grobid2json.py')
0 files changed, 0 insertions, 0 deletions