diff options
author | Bryan Newbold <bnewbold@archive.org> | 2020-08-05 13:06:08 -0700 |
---|---|---|
committer | Bryan Newbold <bnewbold@archive.org> | 2020-08-05 13:06:10 -0700 |
commit | ae531a3314742deb1bdd2560ffbcaa2d1f8d829b (patch) | |
tree | 7237acdce27b7c42690731aa01f9675a88067085 /pig/filter-cdx-paper-pdfs.pig | |
parent | 576b52831d9f17adaee9839db20b4145ba141d96 (diff) | |
download | sandcrawler-ae531a3314742deb1bdd2560ffbcaa2d1f8d829b.tar.gz sandcrawler-ae531a3314742deb1bdd2560ffbcaa2d1f8d829b.zip |
spn2: skip js behavior (experiment)
Hoping this will increase crawling throughput with little-to-no impact
on fidelity.
Diffstat (limited to 'pig/filter-cdx-paper-pdfs.pig')
0 files changed, 0 insertions, 0 deletions