Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | update fatcat_file SQL table schema, and add backfill notes | Bryan Newbold | 2021-12-07 | 1 | -1/+3 |
| | |||||
* | sql: grobid_refs table JSON as 'JSON' not 'JSONB' | Bryan Newbold | 2021-11-04 | 1 | -1/+1 |
| | | | | | I keep flip-flopping on this, but our disk usage is really large, and if 'JSON' is smaller than 'JSONB' in postgresql at all it is worth it. | ||||
* | add grobid_refs and crossref_with_refs to sandcrawler-db SQL schema | Bryan Newbold | 2021-11-04 | 1 | -0/+21 |
| | |||||
* | sql: fixes to ingest_fileset_platform schema (from table creation) | Bryan Newbold | 2021-11-01 | 1 | -6/+6 |
| | |||||
* | sql fileset ingest table iteration | Bryan Newbold | 2021-10-15 | 1 | -12/+11 |
| | |||||
* | sql: initial ingest fileset table | Bryan Newbold | 2021-10-15 | 1 | -0/+38 |
| | |||||
* | sql: fix typo in CHECK statement | Bryan Newbold | 2021-10-15 | 1 | -1/+1 |
| | |||||
* | crossref DB proposal, and include in SQL schema | Bryan Newbold | 2021-06-02 | 1 | -0/+7 |
| | |||||
* | tweak html_meta SQL schema | Bryan Newbold | 2020-11-03 | 1 | -2/+2 |
| | |||||
* | html: start on SQL table | Bryan Newbold | 2020-11-03 | 1 | -0/+15 |
| | |||||
* | tweak pdf_meta SQL schema | Bryan Newbold | 2020-06-17 | 1 | -0/+26 |
| | |||||
* | sandcrawler schema: add MD5 index | Bryan Newbold | 2020-03-05 | 1 | -0/+1 |
| | |||||
* | pdftrio proposal and start on schema+kafka | Bryan Newbold | 2020-02-12 | 1 | -0/+13 |
| | |||||
* | clarify ingest result schema and semantics | Bryan Newbold | 2020-01-15 | 1 | -0/+16 |
| | |||||
* | move SQL schema to diesel migration pattern | Bryan Newbold | 2020-01-02 | 4 | -0/+156 |