aboutsummaryrefslogtreecommitdiffstats
path: root/notebooks
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2020-11-10 19:27:17 -0800
committerBryan Newbold <bnewbold@archive.org>2020-11-10 19:27:19 -0800
commitba035d8641a5e94d93448bb9a0cd56c7756d7055 (patch)
tree5f8d82eeba6c4dd37095b8c67a70217ffc8eba97 /notebooks
parent84fd65b58e33f87b544e2875d87daa941587c511 (diff)
downloadfuzzycat-ba035d8641a5e94d93448bb9a0cd56c7756d7055.tar.gz
fuzzycat-ba035d8641a5e94d93448bb9a0cd56c7756d7055.zip
add support for key denylist
This is to filter out cluster rows where the resulting key is in a given text file (one key per line). The intent is to filter out records with either poor metadata, or very generic metadata, for fuzzy matching. Eg, in many cases it is better to just not try matching "Letter to the Editor" to any record. This won't always be the case; we might have journal, volume, issue, and page, which would allow a match. So this can be specified on the command line.
Diffstat (limited to 'notebooks')
0 files changed, 0 insertions, 0 deletions