aboutsummaryrefslogtreecommitdiffstats
path: root/pig/filter-cdx-paper-pdfs.pig
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2020-08-05 13:06:08 -0700
committerBryan Newbold <bnewbold@archive.org>2020-08-05 13:06:10 -0700
commitae531a3314742deb1bdd2560ffbcaa2d1f8d829b (patch)
tree7237acdce27b7c42690731aa01f9675a88067085 /pig/filter-cdx-paper-pdfs.pig
parent576b52831d9f17adaee9839db20b4145ba141d96 (diff)
downloadsandcrawler-ae531a3314742deb1bdd2560ffbcaa2d1f8d829b.tar.gz
sandcrawler-ae531a3314742deb1bdd2560ffbcaa2d1f8d829b.zip
spn2: skip js behavior (experiment)
Hoping this will increase crawling throughput with little-to-no impact on fidelity.
Diffstat (limited to 'pig/filter-cdx-paper-pdfs.pig')
0 files changed, 0 insertions, 0 deletions