aboutsummaryrefslogtreecommitdiffstats
path: root/mapreduce/tests/test_backfill_hbase_from_cdx.py
diff options
context:
space:
mode:
authorbnewbold <bnewbold@archive.org>2018-06-07 20:02:42 +0000
committerbnewbold <bnewbold@archive.org>2018-06-07 20:02:42 +0000
commit410c48faf2099de74292e8583fcd2524d6fd1b7c (patch)
tree9684b67ee0b2ac8e7ca3a047fc8368477faf19f5 /mapreduce/tests/test_backfill_hbase_from_cdx.py
parent625ef34f957f7f5fdad99c6ce9d84cf7891fbdef (diff)
parent6eca6290aa3fc829f4767023ae075350a0a78192 (diff)
downloadsandcrawler-410c48faf2099de74292e8583fcd2524d6fd1b7c.tar.gz
sandcrawler-410c48faf2099de74292e8583fcd2524d6fd1b7c.zip
Merge branch 'groupby' into 'master'
Added HBaseMimeCount{Job,Test}, which counts the number of rows with each mimetype, as an example of groupby See merge request webgroup/sandcrawler!4
Diffstat (limited to 'mapreduce/tests/test_backfill_hbase_from_cdx.py')
0 files changed, 0 insertions, 0 deletions