aboutsummaryrefslogtreecommitdiffstats
path: root/pig/filter-cdx-pdfs.pig
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2020-03-02 16:37:08 -0800
committerBryan Newbold <bnewbold@archive.org>2020-03-02 16:37:08 -0800
commitb45e1ac6638edb9d634269a343d05eff90daa31e (patch)
tree0c9e6bcedec7c782e2bbd54347a4c614077fd22f /pig/filter-cdx-pdfs.pig
parent6d41261ac417c61a61d0c794fa07639f454bcd52 (diff)
downloadsandcrawler-b45e1ac6638edb9d634269a343d05eff90daa31e.tar.gz
sandcrawler-b45e1ac6638edb9d634269a343d05eff90daa31e.zip
ingest: add force_recrawl flag to skip historical wayback lookup
Diffstat (limited to 'pig/filter-cdx-pdfs.pig')
0 files changed, 0 insertions, 0 deletions