diff options
author | Bryan Newbold <bnewbold@archive.org> | 2020-01-15 21:25:57 -0800 |
---|---|---|
committer | Bryan Newbold <bnewbold@archive.org> | 2020-01-15 21:25:59 -0800 |
commit | e83850ec7bd113e6cfb4af97df37934cb23ef265 (patch) | |
tree | 383ad984d750d1f128e5242bfd06597a0cccee7f /python/ia_pdf_match.py | |
parent | f4862bd582577749c7d71979e3e56650a4a58200 (diff) | |
download | sandcrawler-e83850ec7bd113e6cfb4af97df37934cb23ef265.tar.gz sandcrawler-e83850ec7bd113e6cfb4af97df37934cb23ef265.zip |
wayback replay: catch UnicodeDecodeError
In prod, ran in to a redirect URL like:
b'/web/20200116043630id_/https://mediarep.org/bitstream/handle/doc/1127/Barth\xe9l\xe9my_2015_Life_and_Technology.pdf;jsessionid=A9EFB2798846F5E14A8473BBFD6AB46C?sequence=1'
which broke requests.
Diffstat (limited to 'python/ia_pdf_match.py')
0 files changed, 0 insertions, 0 deletions