aboutsummaryrefslogtreecommitdiffstats
path: root/python/scripts/cdx_collection.py
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2021-10-29 17:08:59 -0700
committerBryan Newbold <bnewbold@archive.org>2021-11-04 17:19:52 -0700
commit341ad36e99d2d1a2f0984fecac857a961bf26fb8 (patch)
tree49799e64d374c1c70af09e4ae64b282fa1bc8351 /python/scripts/cdx_collection.py
parent8723650a87155080984c2e80f9cbf502a42f4fa5 (diff)
downloadsandcrawler-341ad36e99d2d1a2f0984fecac857a961bf26fb8.tar.gz
sandcrawler-341ad36e99d2d1a2f0984fecac857a961bf26fb8.zip
iterated GROBID citation cleaning and processing
Switched to using just 'key'/'id' for downstream matching.
Diffstat (limited to 'python/scripts/cdx_collection.py')
0 files changed, 0 insertions, 0 deletions