aboutsummaryrefslogtreecommitdiffstats
path: root/python_hadoop/grobid2json.py
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2020-08-11 17:40:56 -0700
committerBryan Newbold <bnewbold@archive.org>2020-08-11 17:40:59 -0700
commit92bf9bc28ac0eacab2e06fa3b25b52f0882804c2 (patch)
tree6db86c3a90c76a7027ea787375dcad131e5470da /python_hadoop/grobid2json.py
parent644e412c38c8897e171e3aa1244f1aa6955d8e65 (diff)
downloadsandcrawler-92bf9bc28ac0eacab2e06fa3b25b52f0882804c2.tar.gz
sandcrawler-92bf9bc28ac0eacab2e06fa3b25b52f0882804c2.zip
ingest: reduce CDX retry_sleep to 3.0 sec (after SPN)
As we are moving towards just retrying entire ingest requests, we should probably just make this zero. But until then we should give SPN CDX a small chance to sync before giving up. This change expected to improve overall throughput.
Diffstat (limited to 'python_hadoop/grobid2json.py')
0 files changed, 0 insertions, 0 deletions