<feed xmlns='http://www.w3.org/2005/Atom'>
<title>sandcrawler/sql/migrations, branch master</title>
<subtitle>[no description]</subtitle>
<id>https://git.bnewbold.net/sandcrawler/atom?h=master</id>
<link rel='self' href='https://git.bnewbold.net/sandcrawler/atom?h=master'/>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/'/>
<updated>2022-04-04T22:52:02+00:00</updated>
<entry>
<title>sql: add source/created index on ingest_request table</title>
<updated>2022-04-04T22:52:02+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2022-04-04T22:52:02+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=dadb26935c4d255c5a662f1e758bcf53864f7f95'/>
<id>urn:sha1:dadb26935c4d255c5a662f1e758bcf53864f7f95</id>
<content type='text'>
</content>
</entry>
<entry>
<title>update fatcat_file SQL table schema, and add backfill notes</title>
<updated>2021-12-08T03:10:23+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-12-02T03:06:33+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=3f01f73563f40869c82b7ad3e21c4183fdee8206'/>
<id>urn:sha1:3f01f73563f40869c82b7ad3e21c4183fdee8206</id>
<content type='text'>
</content>
</entry>
<entry>
<title>sql: grobid_refs table JSON as 'JSON' not 'JSONB'</title>
<updated>2021-11-05T00:19:52+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-11-02T03:05:16+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=4315b44a93ca31725b9b0a2a55c310725ac55efe'/>
<id>urn:sha1:4315b44a93ca31725b9b0a2a55c310725ac55efe</id>
<content type='text'>
I keep flip-flopping on this, but our disk usage is really large, and if
'JSON' is smaller than 'JSONB' in postgresql at all it is worth it.
</content>
</entry>
<entry>
<title>add grobid_refs and crossref_with_refs to sandcrawler-db SQL schema</title>
<updated>2021-11-05T00:19:52+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-10-30T01:38:17+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=59af5ddd0a9587eaf53b4f6965c0d6290295ce55'/>
<id>urn:sha1:59af5ddd0a9587eaf53b4f6965c0d6290295ce55</id>
<content type='text'>
</content>
</entry>
<entry>
<title>sql: fixes to ingest_fileset_platform schema (from table creation)</title>
<updated>2021-11-02T03:08:11+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-11-02T03:08:11+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=58acf442af343b4b74f2e206cea1e95145dce744'/>
<id>urn:sha1:58acf442af343b4b74f2e206cea1e95145dce744</id>
<content type='text'>
</content>
</entry>
<entry>
<title>sql fileset ingest table iteration</title>
<updated>2021-10-16T01:15:29+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-10-16T00:14:43+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=350a4e64aa60896391c1040d958b6b039ea3a79f'/>
<id>urn:sha1:350a4e64aa60896391c1040d958b6b039ea3a79f</id>
<content type='text'>
</content>
</entry>
<entry>
<title>sql: initial ingest fileset table</title>
<updated>2021-10-16T01:15:20+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-10-04T19:52:04+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=636ae0e44f6a4bc2e5325cdc8cbf7ae3a1f16d8b'/>
<id>urn:sha1:636ae0e44f6a4bc2e5325cdc8cbf7ae3a1f16d8b</id>
<content type='text'>
</content>
</entry>
<entry>
<title>sql: fix typo in CHECK statement</title>
<updated>2021-10-16T01:15:20+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-10-04T19:51:53+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=9b47798b2fd69fcf3f318bddc896e6342e7f8580'/>
<id>urn:sha1:9b47798b2fd69fcf3f318bddc896e6342e7f8580</id>
<content type='text'>
</content>
</entry>
<entry>
<title>crossref DB proposal, and include in SQL schema</title>
<updated>2021-06-02T07:26:51+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2021-06-02T07:26:51+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=87a9bb4ed81b503f03e6e77d6b082249523e67d4'/>
<id>urn:sha1:87a9bb4ed81b503f03e6e77d6b082249523e67d4</id>
<content type='text'>
</content>
</entry>
<entry>
<title>tweak html_meta SQL schema</title>
<updated>2020-11-04T00:24:16+00:00</updated>
<author>
<name>Bryan Newbold</name>
<email>bnewbold@archive.org</email>
</author>
<published>2020-11-04T00:24:16+00:00</published>
<link rel='alternate' type='text/html' href='https://git.bnewbold.net/sandcrawler/commit/?id=644c6abdb424a3759e06df6b2541d41fb353e95c'/>
<id>urn:sha1:644c6abdb424a3759e06df6b2541d41fb353e95c</id>
<content type='text'>
</content>
</entry>
</feed>
