aboutsummaryrefslogtreecommitdiffstats
BranchCommit messageAuthorAge
masterupdate notesBryan Newbold7 months
bnewbold-argsmake hbase_table and zookeeper_hosts CLI argsBryan Newbold13 months
bnewbold-backfillmake hbase_table and zookeeper_hosts CLI argsBryan Newbold13 months
 
 
AgeCommit messageAuthorFilesLines
2018-12-10update notesHEADmasterBryan Newbold3-1/+59
2018-12-10crank hbase GROBID worker memory usage downBryan Newbold1-1/+1
2018-12-10increase message size (kafka-grobid-hbase)Bryan Newbold1-0/+2
2018-12-10add python-snappy depBryan Newbold2-84/+96
2018-12-03ah, right, it's more like extract/3sec, not 30secBryan Newbold1-4/+4
2018-12-03tweak grobid worker producer settingsBryan Newbold1-2/+2
2018-12-03tweak kafka config significantlyBryan Newbold2-3/+18
2018-12-03more sentry tags when extractingBryan Newbold1-1/+6
2018-12-03improvements to Kafka GROBID worker loggingBryan Newbold2-11/+22
2018-12-01work around kafka topic/group mistakesBryan Newbold1-1/+1
[...]
 
Clone
git@git.bnewbold.net:sandcrawler
https://git.bnewbold.net/sandcrawler
git://git.bnewbold.net/sandcrawler