diff options
Diffstat (limited to 'extraction/TODO')
-rw-r--r-- | extraction/TODO | 6 |
1 files changed, 0 insertions, 6 deletions
diff --git a/extraction/TODO b/extraction/TODO deleted file mode 100644 index 3459752..0000000 --- a/extraction/TODO +++ /dev/null @@ -1,6 +0,0 @@ -- better test coverage (actually check coverage!) -- use pre-mapper command to filter down, eg, by status type? -- automation/docs for bundling virtualenv along -- think about speedups -- abstract CDX line reading and HBase stuff out into a common library -- actual GROBID_SERVER="http://wbgrp-svc096.us.archive.org:8070" |