<feed xmlns='http://www.w3.org/2005/Atom'>
<title>sandcrawler/pig, branch bnewbold-args</title>
<subtitle>[no description]</subtitle>
<id>https://git.bnewbold.net/sandcrawler/atom?h=bnewbold-args</id>
<link rel='self' href='https://git.bnewbold.net/sandcrawler/atom?h=bnewbold-args'/>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/'/>
<updated>2018-05-08T17:06:20+00:00</updated>
<entry>
<title>fix tests post-DISTINCT</title>
<updated>2018-05-08T17:06:20+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2018-05-08T17:06:14+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=18a55d37a87d4391bd8161201c523dd7d7f0f1e7'/>
<id>urn:sha1:18a55d37a87d4391bd8161201c523dd7d7f0f1e7</id>
<content type='text'>
Confirms it's working!
</content>
</entry>
<entry>
<title>distinct on SHA1 in cdx scripts</title>
<updated>2018-05-08T16:58:24+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2018-05-08T16:58:24+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=1831a3b4495aee275e4b4b187fa545eba75eb87b'/>
<id>urn:sha1:1831a3b4495aee275e4b4b187fa545eba75eb87b</id>
<content type='text'>
</content>
</entry>
<entry>
<title>pig cdx join improvements</title>
<updated>2018-05-08T16:58:09+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2018-05-08T16:58:09+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=64503ce8fb755384623821bfabfa81bbb37d8f6e'/>
<id>urn:sha1:64503ce8fb755384623821bfabfa81bbb37d8f6e</id>
<content type='text'>
</content>
</entry>
<entry>
<title>how to run pig in production</title>
<updated>2018-05-08T06:44:39+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2018-05-08T06:44:39+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=681b085bc2a090b8db366c54780f1ec81d811403'/>
<id>urn:sha1:681b085bc2a090b8db366c54780f1ec81d811403</id>
<content type='text'>
</content>
</entry>
<entry>
<title>WIP on filter-cdx-join-urls.pig</title>
<updated>2018-05-08T06:41:10+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2018-05-08T06:41:10+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=2a1c887309305187d785b34a16c1868d26cb3273'/>
<id>urn:sha1:2a1c887309305187d785b34a16c1868d26cb3273</id>
<content type='text'>
</content>
</entry>
<entry>
<title>pig script to filter GWB CDX by SURT regexes</title>
<updated>2018-05-08T05:11:18+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2018-05-08T05:10:51+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=d1401444dbfb515e62094f873d520a23ccbc29d9'/>
<id>urn:sha1:d1401444dbfb515e62094f873d520a23ccbc29d9</id>
<content type='text'>
</content>
</entry>
<entry>
<title>improve pig helper</title>
<updated>2018-05-08T05:11:18+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2018-05-08T05:10:18+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=81d2f6290fff487f0f49b109227443c0f8a7aedb'/>
<id>urn:sha1:81d2f6290fff487f0f49b109227443c0f8a7aedb</id>
<content type='text'>
</content>
</entry>
<entry>
<title>try pig env again</title>
<updated>2018-04-06T22:27:27+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2018-04-06T22:27:27+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=401b09afb5665356a44291af5b42a3cd9836c0fb'/>
<id>urn:sha1:401b09afb5665356a44291af5b42a3cd9836c0fb</id>
<content type='text'>
</content>
</entry>
<entry>
<title>use IA mirror for pig download</title>
<updated>2018-04-06T22:17:32+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2018-04-06T22:17:32+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=b5b266dc45d64e94bb9d721643d06965b7872963'/>
<id>urn:sha1:b5b266dc45d64e94bb9d721643d06965b7872963</id>
<content type='text'>
</content>
</entry>
<entry>
<title>shift docs around a bit</title>
<updated>2018-04-03T02:25:57+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2018-04-03T02:25:57+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=9d4520e8e18d7bf9b36d98d330417360194e80a3'/>
<id>urn:sha1:9d4520e8e18d7bf9b36d98d330417360194e80a3</id>
<content type='text'>
</content>
</entry>
</feed>
