aboutsummaryrefslogtreecommitdiffstats
path: root/python_hadoop/common.py
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2021-11-03 19:21:54 -0700
committerBryan Newbold <bnewbold@archive.org>2021-11-03 19:21:57 -0700
commit891299fd461b17c60fb48364cd5dca08c0711c32 (patch)
treec079448b1e5bfbd5d0acdf2baadc52d05b91352d /python_hadoop/common.py
parent848556a64d13955c2978bad352f2e2cd9edb62d0 (diff)
downloadsandcrawler-891299fd461b17c60fb48364cd5dca08c0711c32.tar.gz
sandcrawler-891299fd461b17c60fb48364cd5dca08c0711c32.zip
IA (wayback): actually use an HTTP session for replay fetches
I am embarassed this wasn't actually the case already! Looks like I had even instantiated a session but wasn't using it. Hopefully this change, which adds extra retries and better backoff behavior, will improve sandcrawler ingest throughput.
Diffstat (limited to 'python_hadoop/common.py')
0 files changed, 0 insertions, 0 deletions