aboutsummaryrefslogtreecommitdiffstats
path: root/sql
Commit message (Expand)AuthorAgeFilesLines
* dump_file_meta helperBryan Newbold2020-10-011-0/+12
* updated sandcrawler-db statsBryan Newbold2020-09-152-6/+346
* WIP weekly re-ingest scriptBryan Newbold2020-08-172-0/+97
* grobid+pdftext missing catch-up commandsBryan Newbold2020-08-054-10/+49
* commit stats from a couple weeks backBryan Newbold2020-08-051-0/+347
* sql stats commands updatesBryan Newbold2020-08-051-2/+2
* commented special modes for dump_unextracted_pdf.sqlBryan Newbold2020-06-251-1/+4
* pdftrio SQL queriesBryan Newbold2020-06-251-0/+65
* SQL commands for re-trying PDF ingestsBryan Newbold2020-06-251-0/+158
* unextracted PDF job dump commandBryan Newbold2020-06-251-0/+16
* tweak pdf_meta SQL schemaBryan Newbold2020-06-171-0/+26
* update sandcrawler stats for early mayBryan Newbold2020-05-041-0/+418
* more monitoring queriesBryan Newbold2020-03-301-5/+29
* make monitoring commands ingest_request local, not ingest_file_resultBryan Newbold2020-03-171-2/+2
* DOI prefix example queries (SQL)Bryan Newbold2020-03-101-3/+17
* helpful daily/weekly monitoring SQL queriesBryan Newbold2020-03-101-0/+94
* sandcrawler schema: add MD5 indexBryan Newbold2020-03-051-0/+1
* more SQL queriesBryan Newbold2020-03-021-0/+57
* recent sandcrawler-db / ingest stats (interesting)Bryan Newbold2020-02-242-0/+488
* dump_regrobid_pdf_petabox.sql scriptBryan Newbold2020-02-121-0/+15
* sandcrawler-db extra statsBryan Newbold2020-02-121-0/+42
* pdftrio proposal and start on schema+kafkaBryan Newbold2020-02-121-0/+13
* more random sandcrawler-db queriesBryan Newbold2020-02-032-32/+62
* more SQL commandsBryan Newbold2020-02-021-0/+15
* sql stats: typo fixBryan Newbold2020-01-281-1/+1
* sql howto: database dumpsBryan Newbold2020-01-281-0/+7
* clarify ingest result schema and semanticsBryan Newbold2020-01-151-0/+16
* database statsBryan Newbold2020-01-142-0/+289
* sql: more cool random queriesBryan Newbold2020-01-021-0/+5
* SQL docs update for diesel changeBryan Newbold2020-01-022-0/+48
* move SQL schema to diesel migration patternBryan Newbold2020-01-025-70/+157
* add some GROBID metadata schema docs to SQL schemaBryan Newbold2019-12-111-0/+11
* add note to CDX backfill script that we should be filtering (oops)Bryan Newbold2019-11-121-0/+1
* SQL stats and commands (mostly from sept 2019)Bryan Newbold2019-11-124-0/+96
* rename postgrest directory sqlBryan Newbold2019-09-239-0/+768