aboutsummaryrefslogtreecommitdiffstats
path: root/python_hadoop/README.md
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2020-09-14 14:13:34 -0700
committerBryan Newbold <bnewbold@archive.org>2020-09-14 14:13:34 -0700
commitee6129ea884036b666de7cff4ad7891675a52b3c (patch)
treef3f2d4970f2622b16425eab7ae0de2eacac30ef5 /python_hadoop/README.md
parent62252a6179953ccc79a6cb60c40a756fa0a034e1 (diff)
downloadsandcrawler-ee6129ea884036b666de7cff4ad7891675a52b3c.tar.gz
sandcrawler-ee6129ea884036b666de7cff4ad7891675a52b3c.zip
ingest: treat text/xml as XHTML in pdf ingest
Diffstat (limited to 'python_hadoop/README.md')
0 files changed, 0 insertions, 0 deletions