<feed xmlns='http://www.w3.org/2005/Atom'>
<title>sandcrawler/proposals, branch bnewbold-refactor-loggging</title>
<subtitle>[no description]</subtitle>
<id>https://git.bnewbold.net/sandcrawler/atom?h=bnewbold-refactor-loggging</id>
<link rel='self' href='https://git.bnewbold.net/sandcrawler/atom?h=bnewbold-refactor-loggging'/>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/'/>
<updated>2022-01-28T01:55:40+00:00</updated>
<entry>
<title>'trawling' proposal (in progress)</title>
<updated>2022-01-28T01:55:40+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2022-01-28T01:55:40+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=6b59e1f4f08662ac9e6c3adb731af31e42f894a6'/>
<id>urn:sha1:6b59e1f4f08662ac9e6c3adb731af31e42f894a6</id>
<content type='text'>
</content>
</entry>
<entry>
<title>codespell fixes in proposals</title>
<updated>2021-11-25T00:01:51+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-11-25T00:01:47+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=d93d542adf9d26633b0f3cfa361277ca677c46f3'/>
<id>urn:sha1:d93d542adf9d26633b0f3cfa361277ca677c46f3</id>
<content type='text'>
</content>
</entry>
<entry>
<title>sql: grobid_refs table JSON as 'JSON' not 'JSONB'</title>
<updated>2021-11-05T00:19:52+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-11-02T03:05:16+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=4315b44a93ca31725b9b0a2a55c310725ac55efe'/>
<id>urn:sha1:4315b44a93ca31725b9b0a2a55c310725ac55efe</id>
<content type='text'>
I keep flip-flopping on this, but our disk usage is really large, and if
'JSON' is smaller than 'JSONB' in postgresql at all it is worth it.
</content>
</entry>
<entry>
<title>update grobid refs proposal</title>
<updated>2021-11-05T00:19:52+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-10-30T01:36:03+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=16f4b7f45ae8bdcd4018850efe164ed19069e9fe'/>
<id>urn:sha1:16f4b7f45ae8bdcd4018850efe164ed19069e9fe</id>
<content type='text'>
</content>
</entry>
<entry>
<title>initial proposal for GROBID refs table and pipeline</title>
<updated>2021-11-05T00:19:52+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-10-29T19:16:22+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=5267f3c778b1bc70830be7f3a45fda52c23477bd'/>
<id>urn:sha1:5267f3c778b1bc70830be7f3a45fda52c23477bd</id>
<content type='text'>
</content>
</entry>
<entry>
<title>sql: fixes to ingest_fileset_platform schema (from table creation)</title>
<updated>2021-11-02T03:08:11+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-11-02T03:08:11+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=58acf442af343b4b74f2e206cea1e95145dce744'/>
<id>urn:sha1:58acf442af343b4b74f2e206cea1e95145dce744</id>
<content type='text'>
</content>
</entry>
<entry>
<title>commit SPN account changes</title>
<updated>2021-10-16T01:17:32+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-10-16T01:17:32+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=12d041b781912dc376444198c920ade2d6cee7c8'/>
<id>urn:sha1:12d041b781912dc376444198c920ade2d6cee7c8</id>
<content type='text'>
</content>
</entry>
<entry>
<title>persist support for ingest platform table, using existing persist worker</title>
<updated>2021-10-16T01:15:29+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-10-16T01:04:39+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=bb06c833f500fbd37579ffb4aa1c53dc0d1e9c96'/>
<id>urn:sha1:bb06c833f500fbd37579ffb4aa1c53dc0d1e9c96</id>
<content type='text'>
</content>
</entry>
<entry>
<title>document passing back platform_base_url</title>
<updated>2021-10-16T01:15:29+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-10-16T00:14:28+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=0a6e449317278e95c3c706aaee19ffb9dc00bebc'/>
<id>urn:sha1:0a6e449317278e95c3c706aaee19ffb9dc00bebc</id>
<content type='text'>
</content>
</entry>
<entry>
<title>filesets: iteration of implementation and docs</title>
<updated>2021-10-16T01:15:29+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-10-15T20:17:23+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=6cccac03451f46cb59897871e6631debca558771'/>
<id>urn:sha1:6cccac03451f46cb59897871e6631debca558771</id>
<content type='text'>
</content>
</entry>
</feed>
