diff options
author | Bryan Newbold <bnewbold@archive.org> | 2020-11-03 16:15:15 -0800 |
---|---|---|
committer | Bryan Newbold <bnewbold@archive.org> | 2020-11-03 16:15:15 -0800 |
commit | 5d45a76e6c2c2ba530484c578db5e726c685eba8 (patch) | |
tree | d44894af0fd6eae44bc003544e2ee2d44ccf4269 /python_hadoop/grobid2json.py | |
parent | a1a4e96e44bfb851003e578defd6f33008be6871 (diff) | |
download | sandcrawler-5d45a76e6c2c2ba530484c578db5e726c685eba8.tar.gz sandcrawler-5d45a76e6c2c2ba530484c578db5e726c685eba8.zip |
ingest: cleanups, typing, start generalizing to xml and html
Diffstat (limited to 'python_hadoop/grobid2json.py')
0 files changed, 0 insertions, 0 deletions