<feed xmlns='http://www.w3.org/2005/Atom'>
<title>sandcrawler/python/scripts, branch trawler</title>
<subtitle>[no description]</subtitle>
<id>https://git.bnewbold.net/sandcrawler/atom?h=trawler</id>
<link rel='self' href='https://git.bnewbold.net/sandcrawler/atom?h=trawler'/>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/'/>
<updated>2021-11-30T23:29:41+00:00</updated>
<entry>
<title>add CDX sha1hex lookup/fetch helper script</title>
<updated>2021-11-30T23:29:41+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-11-30T23:29:41+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=0328598e3b643edd0a2033ca97c607f596dfb092'/>
<id>urn:sha1:0328598e3b643edd0a2033ca97c607f596dfb092</id>
<content type='text'>
</content>
</entry>
<entry>
<title>remove grobid2json helper file, replace with grobid_tei_xml</title>
<updated>2021-10-28T02:10:35+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-10-28T02:10:35+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=a0e275a4bad46ef41585f0207d6dfa1e3c38bc35'/>
<id>urn:sha1:a0e275a4bad46ef41585f0207d6dfa1e3c38bc35</id>
<content type='text'>
</content>
</entry>
<entry>
<title>make fmt (black 21.9b0)</title>
<updated>2021-10-28T01:50:17+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-10-28T01:50:17+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=826c7538e091fac14d987a3cd654975da964e240'/>
<id>urn:sha1:826c7538e091fac14d987a3cd654975da964e240</id>
<content type='text'>
</content>
</entry>
<entry>
<title>make fmt</title>
<updated>2021-10-26T19:54:37+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-10-26T19:54:37+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=05bd7cbcc62588e431c5efd533189e246b2a997e'/>
<id>urn:sha1:05bd7cbcc62588e431c5efd533189e246b2a997e</id>
<content type='text'>
</content>
</entry>
<entry>
<title>python: isort all imports</title>
<updated>2021-10-26T19:22:38+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-10-26T19:22:38+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=3cdf4af9be4c762ff2ed79a57b5ad30637909f1e'/>
<id>urn:sha1:3cdf4af9be4c762ff2ed79a57b5ad30637909f1e</id>
<content type='text'>
</content>
</entry>
<entry>
<title>scripts: example archiveorg-to-fileset importer</title>
<updated>2021-10-16T01:15:20+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-10-04T19:54:35+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=452475df7619f3743eac5ad86e2e1fb8ba9972da'/>
<id>urn:sha1:452475df7619f3743eac5ad86e2e1fb8ba9972da</id>
<content type='text'>
</content>
</entry>
<entry>
<title>cdx_collection.py: minor lint issue</title>
<updated>2021-10-04T20:02:08+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-09-14T02:33:31+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=96033132be8976f0c9483a18dfe4a58bf94b0011'/>
<id>urn:sha1:96033132be8976f0c9483a18dfe4a58bf94b0011</id>
<content type='text'>
</content>
</entry>
<entry>
<title>another lowercase DOI in an (unused?) script</title>
<updated>2021-07-13T18:55:56+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-07-13T18:55:50+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=c468443d325a5e091162cfb5f85697679e87eb72'/>
<id>urn:sha1:c468443d325a5e091162cfb5f85697679e87eb72</id>
<content type='text'>
</content>
</entry>
<entry>
<title>add cdx_collection.py python script (from scratch repo)</title>
<updated>2021-05-04T20:00:52+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-05-04T20:00:52+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=af50cb970644e968ed329f268181d507073b2789'/>
<id>urn:sha1:af50cb970644e968ed329f268181d507073b2789</id>
<content type='text'>
</content>
</entry>
<entry>
<title>doaj ingest request updates (from prod)</title>
<updated>2021-01-06T03:56:29+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-01-06T03:56:29+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=819a68b72c5aa330a3f7a6b91e1581163a62d9f3'/>
<id>urn:sha1:819a68b72c5aa330a3f7a6b91e1581163a62d9f3</id>
<content type='text'>
</content>
</entry>
</feed>
