diff options
| author | Bryan Newbold <bnewbold@archive.org> | 2020-02-22 16:23:25 -0800 | 
|---|---|---|
| committer | Bryan Newbold <bnewbold@archive.org> | 2020-02-22 16:23:54 -0800 | 
| commit | fbfcb3cc2215613d972e589eaad519ea726b5d31 (patch) | |
| tree | 6a8aca51339641159e7ed5e32ceac83e86ac59c7 /hbase/schema_design.md | |
| parent | a2a652cefdfa54c7d6bf16dfcf8b1e2e45fb8947 (diff) | |
| download | sandcrawler-fbfcb3cc2215613d972e589eaad519ea726b5d31.tar.gz sandcrawler-fbfcb3cc2215613d972e589eaad519ea726b5d31.zip | |
ia: improve warc/revisit implementation
A lot of the terminal-bad-status seems to have due to not handling
revisits correctly. They have status_code = '-' or None.
Diffstat (limited to 'hbase/schema_design.md')
0 files changed, 0 insertions, 0 deletions
