Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | merge backfill into extraction directory | Bryan Newbold | 2018-04-04 | 9 | -933/+0 |
| | |||||
* | trivial whitespace | Bryan Newbold | 2018-04-04 | 2 | -1/+2 |
| | |||||
* | more TODO | Bryan Newbold | 2018-04-04 | 1 | -0/+5 |
| | |||||
* | actually running hadoop job on cluster | Bryan Newbold | 2018-04-03 | 2 | -0/+18 |
| | |||||
* | fix silly bugs in backfiller (need more tests) | Bryan Newbold | 2018-04-03 | 1 | -3/+4 |
| | |||||
* | add setuptools (can probably remove) | Bryan Newbold | 2018-04-03 | 2 | -7/+8 |
| | |||||
* | heritrix expects ints, not strings, for numbers | Bryan Newbold | 2018-04-02 | 1 | -7/+7 |
| | |||||
* | backfill: sha1 prefix, cluster example | Bryan Newbold | 2018-03-30 | 3 | -8/+19 |
| | |||||
* | clean up backfill code/tests | Bryan Newbold | 2018-03-30 | 2 | -24/+42 |
| | |||||
* | refactor backfill for mrjob | Bryan Newbold | 2018-03-30 | 4 | -64/+145 |
| | |||||
* | pytest helpers | Bryan Newbold | 2018-03-30 | 4 | -32/+564 |
| | |||||
* | renames | Bryan Newbold | 2018-03-30 | 3 | -0/+265 |