aboutsummaryrefslogtreecommitdiffstats
path: root/python_hadoop/extraction_cdx_grobid.py
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2020-01-28 18:55:36 -0800
committerBryan Newbold <bnewbold@archive.org>2020-01-28 18:55:39 -0800
commit08377ca3fdb7103ce0e0a98f7ae9e2baa39febf8 (patch)
tree7518d73712d36bda2e4c7fda9a3afe923e9c91de /python_hadoop/extraction_cdx_grobid.py
parente0c2cc4b1a41b5de40c9e3adc9cba36d4dc93ed1 (diff)
downloadsandcrawler-08377ca3fdb7103ce0e0a98f7ae9e2baa39febf8.tar.gz
sandcrawler-08377ca3fdb7103ce0e0a98f7ae9e2baa39febf8.zip
grobid worker: always set a key in response
We have key-based compaction enabled for the GROBID output topic. This means it is an error to public to that topic without a key set. Hopefully this change will end these errors, which look like: KafkaError{code=INVALID_MSG,val=2,str="Broker: Invalid message"}
Diffstat (limited to 'python_hadoop/extraction_cdx_grobid.py')
0 files changed, 0 insertions, 0 deletions