aboutsummaryrefslogtreecommitdiffstats
path: root/backfill/backfill_hbase_from_cdx.py
Commit message (Collapse)AuthorAgeFilesLines
* merge backfill into extraction directoryBryan Newbold2018-04-041-195/+0
|
* trivial whitespaceBryan Newbold2018-04-041-1/+1
|
* fix silly bugs in backfiller (need more tests)Bryan Newbold2018-04-031-3/+4
|
* heritrix expects ints, not strings, for numbersBryan Newbold2018-04-021-7/+7
|
* backfill: sha1 prefix, cluster exampleBryan Newbold2018-03-301-2/+6
|
* clean up backfill code/testsBryan Newbold2018-03-301-6/+29
|
* refactor backfill for mrjobBryan Newbold2018-03-301-64/+62
|
* pytest helpersBryan Newbold2018-03-301-0/+169