aboutsummaryrefslogtreecommitdiffstats
BranchCommit messageAuthorAge
master2020-05_pubmed ingest notes (short)Bryan Newbold3 weeks
bnewbold-argsmake hbase_table and zookeeper_hosts CLI argsBryan Newbold2 years
bnewbold-backfillmake hbase_table and zookeeper_hosts CLI argsBryan Newbold2 years
 
 
AgeCommit messageAuthorFilesLines
2020-06-252020-05_pubmed ingest notes (short)HEADmasterBryan Newbold1-0/+10
2020-06-25commit old notes on a one-off CDX table cleanupBryan Newbold1-0/+34
2020-06-25commit old (2020-02) pdftrio commandsBryan Newbold1-0/+162
2020-06-25pdftrio SQL queriesBryan Newbold1-0/+65
2020-06-25SQL commands for re-trying PDF ingestsBryan Newbold1-0/+158
2020-06-25start of RUNBOOK commandsBryan Newbold1-0/+44
2020-06-25unextracted PDF job dump commandBryan Newbold1-0/+16
2020-06-25pdfextract_tool fixes from prod usageBryan Newbold2-3/+6
2020-06-25fix tests for page0_height/widthBryan Newbold1-2/+2
2020-06-25pdfextract: fix pdf_extra key namesBryan Newbold1-2/+2
[...]
 
Clone
git@git.bnewbold.net:sandcrawler
https://git.bnewbold.net/sandcrawler
git://git.bnewbold.net/sandcrawler