<feed xmlns='http://www.w3.org/2005/Atom'>
<title>sandcrawler/backfill, branch trawler</title>
<subtitle>[no description]</subtitle>
<id>https://git.bnewbold.net/sandcrawler/atom?h=trawler</id>
<link rel='self' href='https://git.bnewbold.net/sandcrawler/atom?h=trawler'/>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/'/>
<updated>2018-04-04T18:55:22+00:00</updated>
<entry>
<title>merge backfill into extraction directory</title>
<updated>2018-04-04T18:55:22+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2018-04-04T18:55:22+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=427dd875958c8a6d2d791d55f9dda300ebdc853b'/>
<id>urn:sha1:427dd875958c8a6d2d791d55f9dda300ebdc853b</id>
<content type='text'>
</content>
</entry>
<entry>
<title>trivial whitespace</title>
<updated>2018-04-04T18:48:48+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2018-04-04T18:48:48+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=b948cddc7fe7000fd119af4fa130b9c24da46472'/>
<id>urn:sha1:b948cddc7fe7000fd119af4fa130b9c24da46472</id>
<content type='text'>
</content>
</entry>
<entry>
<title>more TODO</title>
<updated>2018-04-04T18:48:00+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2018-04-04T18:48:00+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=9186783c632b63e7d7c3cd8168139718fba378e9'/>
<id>urn:sha1:9186783c632b63e7d7c3cd8168139718fba378e9</id>
<content type='text'>
</content>
</entry>
<entry>
<title>actually running hadoop job on cluster</title>
<updated>2018-04-03T02:25:28+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2018-04-03T02:25:28+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=ca2d1bd7eccd619173e1d326c9a2ebb1a2d3d502'/>
<id>urn:sha1:ca2d1bd7eccd619173e1d326c9a2ebb1a2d3d502</id>
<content type='text'>
</content>
</entry>
<entry>
<title>fix silly bugs in backfiller (need more tests)</title>
<updated>2018-04-03T02:25:03+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2018-04-03T02:25:03+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=630e956c05604aaf8bf5b7154a01ad956b13e440'/>
<id>urn:sha1:630e956c05604aaf8bf5b7154a01ad956b13e440</id>
<content type='text'>
</content>
</entry>
<entry>
<title>add setuptools (can probably remove)</title>
<updated>2018-04-03T02:23:58+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2018-04-03T02:23:58+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=d8f3c2ffad0b685db6c3196ac3efe846c019f6d7'/>
<id>urn:sha1:d8f3c2ffad0b685db6c3196ac3efe846c019f6d7</id>
<content type='text'>
</content>
</entry>
<entry>
<title>heritrix expects ints, not strings, for numbers</title>
<updated>2018-04-02T21:50:05+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2018-04-02T21:50:05+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=f1c4935b2d55f3acecd824ba9b6318a33820fafe'/>
<id>urn:sha1:f1c4935b2d55f3acecd824ba9b6318a33820fafe</id>
<content type='text'>
</content>
</entry>
<entry>
<title>backfill: sha1 prefix, cluster example</title>
<updated>2018-03-31T05:53:03+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2018-03-31T05:53:03+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=31d5a1ebdfe2f4638ae1e5ec87ff228eef9114f5'/>
<id>urn:sha1:31d5a1ebdfe2f4638ae1e5ec87ff228eef9114f5</id>
<content type='text'>
</content>
</entry>
<entry>
<title>clean up backfill code/tests</title>
<updated>2018-03-31T02:12:31+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2018-03-31T02:12:31+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=bb38ea065758a719331803b4adf875f2d75a1c3d'/>
<id>urn:sha1:bb38ea065758a719331803b4adf875f2d75a1c3d</id>
<content type='text'>
</content>
</entry>
<entry>
<title>refactor backfill for mrjob</title>
<updated>2018-03-31T01:29:29+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2018-03-31T01:29:29+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=f6915b4b44e312cee7eda9626d0330268ab786e2'/>
<id>urn:sha1:f6915b4b44e312cee7eda9626d0330268ab786e2</id>
<content type='text'>
</content>
</entry>
</feed>
