aboutsummaryrefslogtreecommitdiffstats
path: root/python/sandcrawler/html.py
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2022-07-20 18:03:12 -0700
committerBryan Newbold <bnewbold@archive.org>2022-07-20 18:03:12 -0700
commit98b95dea4eafec78f16f6afbabfe65aa2489e78f (patch)
tree2aa4ff3337a4315dbdf5cbf84b086cc14235dc8c /python/sandcrawler/html.py
parenta72019e6e788be64420719c5045e40614098c106 (diff)
downloadsandcrawler-98b95dea4eafec78f16f6afbabfe65aa2489e78f.tar.gz
sandcrawler-98b95dea4eafec78f16f6afbabfe65aa2489e78f.zip
ingest: more PDF fulltext URL patterns
Diffstat (limited to 'python/sandcrawler/html.py')
0 files changed, 0 insertions, 0 deletions