diff options
author | Bryan Newbold <bnewbold@archive.org> | 2020-06-29 15:21:34 -0700 |
---|---|---|
committer | Bryan Newbold <bnewbold@archive.org> | 2020-06-29 15:21:37 -0700 |
commit | 2f4b35f29f53b0e643c3e7cd74e63370758dc490 (patch) | |
tree | 685962293debf2d91326cbef281f1d3cb717ef4e /python/tests/test_grobid.py | |
parent | 800860ecd25346ff4a638e9d42fa905396b8fa1b (diff) | |
download | sandcrawler-2f4b35f29f53b0e643c3e7cd74e63370758dc490.tar.gz sandcrawler-2f4b35f29f53b0e643c3e7cd74e63370758dc490.zip |
hack to unblock thumbnail processing pipeline
Some PDFs taking 10+ minutes to process, causing kafka exceptions and
consumer churn. Not sure why kafka json pusher timeouts are not catching
these.
Diffstat (limited to 'python/tests/test_grobid.py')
0 files changed, 0 insertions, 0 deletions