diff options
author | Bryan Newbold <bnewbold@archive.org> | 2020-02-12 19:40:55 -0800 |
---|---|---|
committer | Bryan Newbold <bnewbold@archive.org> | 2020-02-12 19:42:43 -0800 |
commit | 94912e739c51d2fa4d5f9de878d0b0f0544a4459 (patch) | |
tree | af7803bee388beba7dd6dce2113e3632284537ac /proposals | |
parent | 6b3ce3169847a16fe6c0ab00f3a8af8b8ad099ab (diff) | |
download | sandcrawler-94912e739c51d2fa4d5f9de878d0b0f0544a4459.tar.gz sandcrawler-94912e739c51d2fa4d5f9de878d0b0f0544a4459.zip |
pdftrio basic python code
This is basically just a copy/paste of GROBID code, only simpler!
Diffstat (limited to 'proposals')
-rw-r--r-- | proposals/20200207_pdftrio.md | 4 |
1 files changed, 2 insertions, 2 deletions
diff --git a/proposals/20200207_pdftrio.md b/proposals/20200207_pdftrio.md index b1b09f9..78d2d6c 100644 --- a/proposals/20200207_pdftrio.md +++ b/proposals/20200207_pdftrio.md @@ -71,8 +71,8 @@ Output that goes in Kafka topic: bert_model (string) image_model (string) linear_model (string) - timing - ... (might be added?) + timing (might be added?) + ... file_meta sha1hex ... |