diff options
| author | Bryan Newbold <bnewbold@archive.org> | 2020-01-10 17:19:32 -0800 | 
|---|---|---|
| committer | Bryan Newbold <bnewbold@archive.org> | 2020-01-10 17:19:32 -0800 | 
| commit | 5b7f613f77c5bc77f071bcb7cc975c5f4dd02c87 (patch) | |
| tree | 8b26a0df650011f8822dc3e992fb1ab40f6ff5bc /python_hadoop/extraction_cdx_grobid.py | |
| parent | f916655ab949ee11b3aa6bc84bb3b2118b0748d0 (diff) | |
| download | sandcrawler-5b7f613f77c5bc77f071bcb7cc975c5f4dd02c87.tar.gz sandcrawler-5b7f613f77c5bc77f071bcb7cc975c5f4dd02c87.zip | |
hack/workaround for protocols.io octet PDFs
Diffstat (limited to 'python_hadoop/extraction_cdx_grobid.py')
0 files changed, 0 insertions, 0 deletions
