diff options
Diffstat (limited to 'python')
-rwxr-xr-x | python/kafka_grobid.py | 3 |
1 files changed, 3 insertions, 0 deletions
diff --git a/python/kafka_grobid.py b/python/kafka_grobid.py index cde7a2d..8d1bd2e 100755 --- a/python/kafka_grobid.py +++ b/python/kafka_grobid.py @@ -1,5 +1,8 @@ #!/usr/bin/env python3 """ +DEPRECATED: this worker uses old kafka topics and an old schema. Use +`sandcrawler_worker.py` instead. + Kafka worker that does GROBID extraction from one queue and into another. Based on the ungrobided Hadoop job code. Does not talk to HBase at all, just |