diff options
author | Bryan Newbold <bnewbold@archive.org> | 2020-11-03 23:37:50 -0800 |
---|---|---|
committer | Bryan Newbold <bnewbold@archive.org> | 2020-11-04 09:04:47 -0800 |
commit | a87ca1de1d8b31c4fbf9fddead27cdc58b09565a (patch) | |
tree | 575868d7d9c0d3d28a37d288d5e4975e57c8eaab /notes/url_pattern_heuristic_verification.txt | |
parent | 8f964b9b48572ac71f27ba64207816dfd3a6dc36 (diff) | |
download | sandcrawler-a87ca1de1d8b31c4fbf9fddead27cdc58b09565a.tar.gz sandcrawler-a87ca1de1d8b31c4fbf9fddead27cdc58b09565a.zip |
initial implementation of HTML ingest in existing worker
Diffstat (limited to 'notes/url_pattern_heuristic_verification.txt')
0 files changed, 0 insertions, 0 deletions