diff options
author | Bryan Newbold <bnewbold@archive.org> | 2020-06-25 23:08:47 -0700 |
---|---|---|
committer | Bryan Newbold <bnewbold@archive.org> | 2020-06-25 23:08:47 -0700 |
commit | 254f24ad6566c9d4b5814868911b604802847b58 (patch) | |
tree | 20e9df75d4b89a29d658f57821899044cab34956 /pig/filter-cdx-join-urls.pig | |
parent | fa25698620f54207423c3da40ae1bca567b598fa (diff) | |
download | sandcrawler-254f24ad6566c9d4b5814868911b604802847b58.tar.gz sandcrawler-254f24ad6566c9d4b5814868911b604802847b58.zip |
simpler handling of null PDF text pages
Diffstat (limited to 'pig/filter-cdx-join-urls.pig')
0 files changed, 0 insertions, 0 deletions