aboutsummaryrefslogtreecommitdiffstats
path: root/python_hadoop/tests/test_backfill_hbase_from_cdx.py
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2020-03-17 16:33:16 -0700
committerBryan Newbold <bnewbold@archive.org>2020-03-17 16:36:45 -0700
commitb23469d5b978b3b42a0aa55e7a191280fe1beccd (patch)
tree6b053293df5a60e3000ad975168a52804e66d7d1 /python_hadoop/tests/test_backfill_hbase_from_cdx.py
parent30ba490bb65d195b14f5b06aea2de5b4eb1d23d2 (diff)
downloadsandcrawler-b23469d5b978b3b42a0aa55e7a191280fe1beccd.tar.gz
sandcrawler-b23469d5b978b3b42a0aa55e7a191280fe1beccd.zip
work around local redirect (resource.location)
Some redirects are host-local. This patch crudely detects this (full-path redirects starting with "/" only), and appends the URL to the host of the original URL.
Diffstat (limited to 'python_hadoop/tests/test_backfill_hbase_from_cdx.py')
0 files changed, 0 insertions, 0 deletions