aboutsummaryrefslogtreecommitdiffstats
path: root/kafka/grobid_kafka_notes.txt
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2021-09-03 10:37:37 -0700
committerBryan Newbold <bnewbold@archive.org>2021-09-03 10:37:37 -0700
commit2ebef36c083b59d158fae7098da49bf972141f1c (patch)
treec66f18e72312ce5598c8355164a9dfbe241ef5bc /kafka/grobid_kafka_notes.txt
parentd963a61ea3e4bf278fd62047b258722967cd20c9 (diff)
downloadsandcrawler-2ebef36c083b59d158fae7098da49bf972141f1c.tar.gz
sandcrawler-2ebef36c083b59d158fae7098da49bf972141f1c.zip
HTML ingest: several more PDF fulltext URL patterns
Diffstat (limited to 'kafka/grobid_kafka_notes.txt')
0 files changed, 0 insertions, 0 deletions