diff options
author | Bryan Newbold <bnewbold@archive.org> | 2020-11-08 18:03:54 -0800 |
---|---|---|
committer | Bryan Newbold <bnewbold@archive.org> | 2020-11-08 18:03:54 -0800 |
commit | a8ff73617a16a8b8b524c454247bde2399f34bf1 (patch) | |
tree | 287804f91071d57aed7bc1a223080a2f3f653354 /notes/url_pattern_heuristic_verification.txt | |
parent | 1a8601bdc36640894d1c34f5c92bc2eda5771bca (diff) | |
download | sandcrawler-a8ff73617a16a8b8b524c454247bde2399f34bf1.tar.gz sandcrawler-a8ff73617a16a8b8b524c454247bde2399f34bf1.zip |
html: more robust ingest; better platform and scope detection
Diffstat (limited to 'notes/url_pattern_heuristic_verification.txt')
0 files changed, 0 insertions, 0 deletions