aboutsummaryrefslogtreecommitdiffstats
path: root/scalding
Commit message (Collapse)AuthorAgeFilesLines
* Added job and test for counting mime types.Ellen Spertus2018-06-062-0/+96
|
* Made package names match directory names. Cleaned up imports.Ellen Spertus2018-06-054-16/+13
|
* Merge branch 'refactoring' into 'master'bnewbold2018-06-044-20/+101
|\ | | | | | | | | Refactoring to add, use, and test class HBaseBuilder to eliminate duplicated code and facilitate HBaseSource creation See merge request webgroup/sandcrawler!1
| * Made changes suggested in merge request review.Ellen Spertus2018-06-043-15/+10
| | | | | | | | | | - Changed inverseSchema from Map to List, eliminating incorrect comment. - Changing format of argument to HBaseBuilder.build from String to List[String].
| * Changed interface to HBaseBuilder.parseColSpec.Ellen Spertus2018-06-033-8/+12
| |
| * Added HBaseBuilder.build() and had HBaseRowCountJob call it.Ellen Spertus2018-06-032-11/+5
| |
| * Added HBaseBuilder.parseColSpecs and tests, which pass.Ellen Spertus2018-06-032-0/+92
| |
| * Factored common code out of HBaseRowCountJob and its test into a new ↵Ellen Spertus2018-06-012-16/+12
| | | | | | | | companion object.
* | fetch SpyGlass jar from archive.org (not local)Bryan Newbold2018-06-042-19/+7
|/
* Provided full path to cascading jar in command line.Ellen Spertus2018-05-311-1/+1
|
* Added tip on OutOfMemoryError.Ellen Spertus2018-05-311-1/+5
|
* Added debugging info for cascading.tuple.Fields.Ellen Spertus2018-05-311-1/+23
|
* switch HBaseRowCountJob to SCAN_ALLBryan Newbold2018-05-292-5/+11
|
* HBaseRowCountJob actually counts rowsBryan Newbold2018-05-292-13/+8
|
* update version and project nameBryan Newbold2018-05-243-4/+6
|
* cleanup scalding notes/READMEBryan Newbold2018-05-243-37/+162
|
* assemblyMergeStrategy deprecation warningBryan Newbold2018-05-241-2/+2
|
* rename jvm/scalding directoriesBryan Newbold2018-05-2412-0/+365