aboutsummaryrefslogtreecommitdiffstats
path: root/notes/url_pattern_heuristic_backfill.txt
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2021-11-03 18:41:30 -0700
committerBryan Newbold <bnewbold@archive.org>2021-11-04 17:19:52 -0700
commit1f078fe94a5cf5322527b97dcdf0cb054e0c7540 (patch)
tree7bc5419530c16f9fc64f112113942398ec62eb9a /notes/url_pattern_heuristic_backfill.txt
parent8577d50b644dd45bce5275675eed4d43bb816b67 (diff)
downloadsandcrawler-1f078fe94a5cf5322527b97dcdf0cb054e0c7540.tar.gz
sandcrawler-1f078fe94a5cf5322527b97dcdf0cb054e0c7540.zip
grobid: handle weird whitespace unstructured from crossref
See also: https://github.com/kermitt2/grobid/issues/849
Diffstat (limited to 'notes/url_pattern_heuristic_backfill.txt')
0 files changed, 0 insertions, 0 deletions