aboutsummaryrefslogtreecommitdiffstats
path: root/scalding/src/main/scala/sandcrawler/CdxBackfillJob.scala
Commit message (Collapse)AuthorAgeFilesLines
* small CdxBackfillJob refactor (code quality)Bryan Newbold2018-07-241-5/+5
|
* do sha1 pattern match correctlyBryan Newbold2018-07-241-3/+7
|
* more PDF mimetypes; fix return refactorBryan Newbold2018-07-241-2/+5
|
* CdxBackfillJob: comment cleanupBryan Newbold2018-07-241-6/+0
|
* CdxBackfillJob: scalastyleBryan Newbold2018-07-241-22/+14
|
* address some (but not all) review commentsBryan Newbold2018-07-241-20/+21
|
* fix CdxBackfillJob testsBryan Newbold2018-07-241-4/+11
|
* some scalastyle on CdxBackfillJobBryan Newbold2018-07-241-7/+8
|
* CdxBackfillJob: implement other fieldsBryan Newbold2018-07-241-10/+24
|
* CdxBackfillJob back to HBase; tests workBryan Newbold2018-07-241-7/+5
|
* variant of CdxBackfillJob that writes to TSVBryan Newbold2018-07-241-0/+173
Has the same test failure ("java.lang.IndexOutOfBoundsException: Index: 1, Size: 1")