aboutsummaryrefslogtreecommitdiffstats
path: root/mapreduce/extraction_cdx_grobid.py
diff options
context:
space:
mode:
authorbnewbold <bnewbold@archive.org>2018-08-23 22:55:39 +0000
committerbnewbold <bnewbold@archive.org>2018-08-23 22:55:39 +0000
commitc6e9aa4226aa8ed02c80e829ddb1d3fd40103017 (patch)
tree7cadfce40b8e1873d95609bfeff41181ef5ac308 /mapreduce/extraction_cdx_grobid.py
parent03968da99d24d81e0224712056d1dea38cb8c70e (diff)
parent6b401b34f189475efb84e72dafa2124ac50b5ee8 (diff)
downloadsandcrawler-c6e9aa4226aa8ed02c80e829ddb1d3fd40103017.tar.gz
sandcrawler-c6e9aa4226aa8ed02c80e829ddb1d3fd40103017.zip
Merge branch 'ellen-length-filtering' into 'master'
Filtering titles by length See merge request webgroup/sandcrawler!21
Diffstat (limited to 'mapreduce/extraction_cdx_grobid.py')
0 files changed, 0 insertions, 0 deletions