diff options
author | Bryan Newbold <bnewbold@archive.org> | 2020-11-08 18:02:51 -0800 |
---|---|---|
committer | Bryan Newbold <bnewbold@archive.org> | 2020-11-08 18:02:51 -0800 |
commit | 1a8601bdc36640894d1c34f5c92bc2eda5771bca (patch) | |
tree | 036bfa0cab2f5b06bbe93956a7d8d2b95241d1a9 /scalding/src/main/resources/slug-denylist.txt | |
parent | abe36a83d189e13f3fe20519ccc4d90114e71455 (diff) | |
download | sandcrawler-1a8601bdc36640894d1c34f5c92bc2eda5771bca.tar.gz sandcrawler-1a8601bdc36640894d1c34f5c92bc2eda5771bca.zip |
html: more extraction patterns; bugfix; skip more crossmark
Diffstat (limited to 'scalding/src/main/resources/slug-denylist.txt')
0 files changed, 0 insertions, 0 deletions