aboutsummaryrefslogtreecommitdiffstats
path: root/backfill/backfill_hbase_from_cdx.py
Commit message (Expand)AuthorAgeFilesLines
* merge backfill into extraction directoryBryan Newbold2018-04-041-195/+0
* trivial whitespaceBryan Newbold2018-04-041-1/+1
* fix silly bugs in backfiller (need more tests)Bryan Newbold2018-04-031-3/+4
* heritrix expects ints, not strings, for numbersBryan Newbold2018-04-021-7/+7
* backfill: sha1 prefix, cluster exampleBryan Newbold2018-03-301-2/+6
* clean up backfill code/testsBryan Newbold2018-03-301-6/+29
* refactor backfill for mrjobBryan Newbold2018-03-301-64/+62
* pytest helpersBryan Newbold2018-03-301-0/+169