aboutsummaryrefslogtreecommitdiffstats
BranchCommit messageAuthorAge
master2020-05_pubmed ingest notes (short)Bryan Newbold7 days
bnewbold-argsmake hbase_table and zookeeper_hosts CLI argsBryan Newbold2 years
bnewbold-backfillmake hbase_table and zookeeper_hosts CLI argsBryan Newbold2 years
 
 
AgeCommit messageAuthorFilesLines
7 days2020-05_pubmed ingest notes (short)HEADmasterBryan Newbold1-0/+10
7 dayscommit old notes on a one-off CDX table cleanupBryan Newbold1-0/+34
7 dayscommit old (2020-02) pdftrio commandsBryan Newbold1-0/+162
7 dayspdftrio SQL queriesBryan Newbold1-0/+65
7 daysSQL commands for re-trying PDF ingestsBryan Newbold1-0/+158
7 daysstart of RUNBOOK commandsBryan Newbold1-0/+44
7 daysunextracted PDF job dump commandBryan Newbold1-0/+16
7 dayspdfextract_tool fixes from prod usageBryan Newbold2-3/+6
7 daysfix tests for page0_height/widthBryan Newbold1-2/+2
7 dayspdfextract: fix pdf_extra key namesBryan Newbold1-2/+2
[...]
 
Clone
git@git.bnewbold.net:sandcrawler
https://git.bnewbold.net/sandcrawler
git://git.bnewbold.net/sandcrawler