diff options
author | Bryan Newbold <bnewbold@archive.org> | 2020-11-08 13:20:58 -0800 |
---|---|---|
committer | Bryan Newbold <bnewbold@archive.org> | 2020-11-08 13:20:58 -0800 |
commit | 00ed69dd00d07344d62c5adad4e9d15c721c3bb1 (patch) | |
tree | 27f0298d6edd76a99686049470faeed1f54a0c69 /python_hadoop/tests/test_extraction_ungrobided.py | |
parent | 3977afdff906367525cd6959221b6f3edf19793d (diff) | |
download | sandcrawler-00ed69dd00d07344d62c5adad4e9d15c721c3bb1.tar.gz sandcrawler-00ed69dd00d07344d62c5adad4e9d15c721c3bb1.zip |
html: try to detect and mark XHTML (vs. HTML or XML)
Diffstat (limited to 'python_hadoop/tests/test_extraction_ungrobided.py')
0 files changed, 0 insertions, 0 deletions