index
:
sandcrawler
bnewbold-args
bnewbold-backfill
bnewbold-persist-grobid-errors
bnewbold-refactor-loggging
master
trawler
[no description]
about
summary
refs
log
tree
commit
diff
stats
log msg
author
committer
range
path:
root
/
python
/
grobid_tool.py
Commit message (
Expand
)
Author
Age
Files
Lines
*
switch default kafka-broker host from wbgrp-svc263 to wbgrp-svc350
Bryan Newbold
2022-05-03
1
-1
/
+1
*
grobid_tool: helper to process a single file
Bryan Newbold
2021-11-10
1
-0
/
+15
*
initial crossref-refs via GROBID helper routine
Bryan Newbold
2021-11-04
1
-1
/
+20
*
remove grobid2json helper file, replace with grobid_tei_xml
Bryan Newbold
2021-10-27
1
-2
/
+4
*
make fmt (black 21.9b0)
Bryan Newbold
2021-10-27
1
-41
/
+51
*
start handling trivial lint cleanups: unused imports, 'is None', etc
Bryan Newbold
2021-10-26
1
-1
/
+0
*
make fmt
Bryan Newbold
2021-10-26
1
-30
/
+34
*
python: isort all imports
Bryan Newbold
2021-10-26
1
-2
/
+2
*
better default CLI output (show usage)
Bryan Newbold
2020-10-29
1
-1
/
+1
*
rename KafkaGrobidSink -> KafkaCompressSink
Bryan Newbold
2020-06-16
1
-1
/
+1
*
batch/multiprocess for ZipfilePusher
Bryan Newbold
2020-04-16
1
-2
/
+8
*
more ftp status 226 support
Bryan Newbold
2020-01-14
1
-5
/
+13
*
commit grobid_tool transform mode
Bryan Newbold
2019-12-22
1
-0
/
+27
*
refactor: improve argparse usage
Bryan Newbold
2019-12-18
1
-4
/
+8
*
grobid_tool: don't wrap multiprocess if we don't need to
Bryan Newbold
2019-10-04
1
-2
/
+4
*
grobid-output-pg, not grobid-output-json
Bryan Newbold
2019-10-04
1
-4
/
+2
*
grobid_tool: don't always insert multi wrapper
Bryan Newbold
2019-10-04
1
-6
/
+13
*
grobid_tool.py example usage in docstring
Bryan Newbold
2019-10-02
1
-0
/
+6
*
more counts and bugfixes in grobid_tool
Bryan Newbold
2019-09-26
1
-1
/
+1
*
small improvements to GROBID tool
Bryan Newbold
2019-09-26
1
-2
/
+6
*
lots of grobid tool implementation (still WIP)
Bryan Newbold
2019-09-26
1
-0
/
+87