diff options
author | bnewbold <bnewbold@archive.org> | 2018-06-07 20:02:42 +0000 |
---|---|---|
committer | bnewbold <bnewbold@archive.org> | 2018-06-07 20:02:42 +0000 |
commit | 410c48faf2099de74292e8583fcd2524d6fd1b7c (patch) | |
tree | 9684b67ee0b2ac8e7ca3a047fc8368477faf19f5 /cdx-record-pipeline/README.md | |
parent | 625ef34f957f7f5fdad99c6ce9d84cf7891fbdef (diff) | |
parent | 6eca6290aa3fc829f4767023ae075350a0a78192 (diff) | |
download | sandcrawler-410c48faf2099de74292e8583fcd2524d6fd1b7c.tar.gz sandcrawler-410c48faf2099de74292e8583fcd2524d6fd1b7c.zip |
Merge branch 'groupby' into 'master'
Added HBaseMimeCount{Job,Test}, which counts the number of rows with each mimetype, as an example of groupby
See merge request webgroup/sandcrawler!4
Diffstat (limited to 'cdx-record-pipeline/README.md')
0 files changed, 0 insertions, 0 deletions