diff options
| author | Bryan Newbold <bnewbold@archive.org> | 2020-08-11 17:22:10 -0700 | 
|---|---|---|
| committer | Bryan Newbold <bnewbold@archive.org> | 2020-08-11 17:22:10 -0700 | 
| commit | 7e8ff96fb90ddd1c853418a6c405d97afbc45355 (patch) | |
| tree | 8efd62adf4a92f44bdb71384af955400102b0f34 /python_hadoop | |
| parent | d5f0602e80847adf3d359a7fd06cc131c07cb6dd (diff) | |
| download | sandcrawler-7e8ff96fb90ddd1c853418a6c405d97afbc45355.tar.gz sandcrawler-7e8ff96fb90ddd1c853418a6c405d97afbc45355.zip | |
check for simple URL patterns that are usually paywalls or loginwalls
Diffstat (limited to 'python_hadoop')
0 files changed, 0 insertions, 0 deletions
