diff options
| author | Bryan Newbold <bnewbold@archive.org> | 2021-11-04 11:49:33 -0700 | 
|---|---|---|
| committer | Bryan Newbold <bnewbold@archive.org> | 2021-11-04 17:19:52 -0700 | 
| commit | dd8cdc88f71e6a395ab5b10d84d6443f70e39048 (patch) | |
| tree | c97e096d81c49d8ac1f4853b565b5a527960a0f3 /pig/filter-cdx-pdfs.pig | |
| parent | 34b3415433c65dfb41746a3a335e7217c7d1144e (diff) | |
| download | sandcrawler-dd8cdc88f71e6a395ab5b10d84d6443f70e39048.tar.gz sandcrawler-dd8cdc88f71e6a395ab5b10d84d6443f70e39048.zip | |
crossref grobid refs: another error case (ReadTimeout)
With this last exception handled, was about to get through millions of
rows of references, with only a few dozen errors (mostly invalid XML).
Diffstat (limited to 'pig/filter-cdx-pdfs.pig')
0 files changed, 0 insertions, 0 deletions
