diff options
author | Bryan Newbold <bnewbold@archive.org> | 2020-01-14 16:12:29 -0800 |
---|---|---|
committer | Bryan Newbold <bnewbold@archive.org> | 2020-01-14 16:12:29 -0800 |
commit | 2bf0095335203d200370e23922a6ff38ac98201c (patch) | |
tree | 312240cbb069f681a7544775f0d49d903f31239f /pig/filter-cdx-join-urls.pig | |
parent | 29d53a3b8cd27cb7a40ca9588a85ccb49dd98352 (diff) | |
download | sandcrawler-2bf0095335203d200370e23922a6ff38ac98201c.tar.gz sandcrawler-2bf0095335203d200370e23922a6ff38ac98201c.zip |
filter out archive.org and web.archive.org (until implemented)
Diffstat (limited to 'pig/filter-cdx-join-urls.pig')
0 files changed, 0 insertions, 0 deletions