aboutsummaryrefslogtreecommitdiffstats
path: root/python
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2021-11-16 00:24:13 -0800
committerBryan Newbold <bnewbold@archive.org>2021-11-16 00:24:15 -0800
commite323b390bf6790edfd8a4d0683642a480e62c743 (patch)
treeb587248bccfa3bffc122bc53dc2412bb15d2f636 /python
parentbd8b0b4d0d1fb04e34002dd3da91d83b922d0991 (diff)
downloadsandcrawler-e323b390bf6790edfd8a4d0683642a480e62c743.tar.gz
sandcrawler-e323b390bf6790edfd8a4d0683642a480e62c743.zip
SPNv2: make 'resources' optional
This was always present previously. A change was made to SPNv2 API recently that borked it a bit, though in theory should be present on new captures. I'm not seeing it for some captures, so pushing this work around. It seems like we don't actually use this field anyways, at least for ingest pipeline.
Diffstat (limited to 'python')
-rw-r--r--python/sandcrawler/ia.py2
1 files changed, 1 insertions, 1 deletions
diff --git a/python/sandcrawler/ia.py b/python/sandcrawler/ia.py
index d334d24..dc9aae5 100644
--- a/python/sandcrawler/ia.py
+++ b/python/sandcrawler/ia.py
@@ -1101,7 +1101,7 @@ class SavePageNowClient:
request_url,
final_json["original_url"],
final_json["timestamp"],
- final_json["resources"],
+ final_json.get("resources") or None,
)
else:
if final_json["status"] == "pending":