aboutsummaryrefslogtreecommitdiffstats
BranchCommit messageAuthorAge
masternew/additional GWB CDX filter scriptsBryan Newbold6 days
bnewbold-argsmake hbase_table and zookeeper_hosts CLI argsBryan Newbold17 months
bnewbold-backfillmake hbase_table and zookeeper_hosts CLI argsBryan Newbold17 months
 
 
AgeCommit messageAuthorFilesLines
6 daysnew/additional GWB CDX filter scriptsHEADmasterBryan Newbold7-0/+142
2019-10-04we do actually want consolidateHeader=2, not 1Bryan Newbold2-4/+4
2019-10-04remove any trailing newlineBryan Newbold1-2/+2
2019-10-04grobid: consolidateHeaders typoBryan Newbold1-1/+1
2019-10-04grobid_tool: don't wrap multiprocess if we don't need toBryan Newbold1-2/+4
2019-10-04disable citation consolidation by defaultBryan Newbold1-1/+1
2019-10-04grobid-output-pg, not grobid-output-jsonBryan Newbold1-4/+2
2019-10-04grobid_tool: don't always insert multi wrapperBryan Newbold1-6/+13
2019-10-04grobid2json: language_codeBryan Newbold2-1/+7
2019-10-04fix GROBID POST flagsBryan Newbold1-1/+3
[...]
 
Clone
git@git.bnewbold.net:sandcrawler
https://git.bnewbold.net/sandcrawler
git://git.bnewbold.net/sandcrawler