aboutsummaryrefslogtreecommitdiffstats
path: root/python/grobid2json.py
Commit message (Collapse)AuthorAgeFilesLines
* remove grobid2json helper file, replace with grobid_tei_xmlBryan Newbold2021-10-271-210/+0
|
* make fmt (black 21.9b0)Bryan Newbold2021-10-271-1/+2
|
* make fmtBryan Newbold2021-10-261-9/+3
|
* python: isort all importsBryan Newbold2021-10-261-2/+2
|
* grobid2json: fix deprecation warningBryan Newbold2020-10-271-1/+1
|
* update grobid2json with type annotationsBryan Newbold2020-06-171-94/+110
|
* be more parsimonious with GROBID metadataBryan Newbold2020-01-021-1/+16
| | | | | Because these are getting persisted in database (as well as kafka), don't write out empty keys.
* refactor: sort keys in JSON outputBryan Newbold2019-12-181-1/+2
| | | | This makes debugging by tailing Kafka topics a lot more readable
* refactor: improve argparse usageBryan Newbold2019-12-181-0/+1
| | | | | use ArgumentDefaultsHelpFormatter and add help messages to all sub-commands
* grobid2json: make lang detection flexibleBryan Newbold2019-11-131-1/+2
|
* remove spurious debug print from grobid2jsonBryan Newbold2019-10-221-1/+1
|
* remove any trailing newlineBryan Newbold2019-10-041-2/+2
|
* grobid2json: language_codeBryan Newbold2019-10-041-0/+5
|
* have grobid2json extract full names and affiliationsBryan Newbold2019-10-021-5/+27
|
* grobid2json: extract fatcat identifierBryan Newbold2019-09-201-1/+5
|
* update grobid2json to include given_name/surnameBryan Newbold2019-05-131-3/+7
|
* rename ./mapreduce to ./pythonBryan Newbold2018-08-241-0/+146