diff options
author | Bryan Newbold <bnewbold@archive.org> | 2020-11-10 19:27:17 -0800 |
---|---|---|
committer | Bryan Newbold <bnewbold@archive.org> | 2020-11-10 19:27:19 -0800 |
commit | ba035d8641a5e94d93448bb9a0cd56c7756d7055 (patch) | |
tree | 5f8d82eeba6c4dd37095b8c67a70217ffc8eba97 /notebooks | |
parent | 84fd65b58e33f87b544e2875d87daa941587c511 (diff) | |
download | fuzzycat-ba035d8641a5e94d93448bb9a0cd56c7756d7055.tar.gz fuzzycat-ba035d8641a5e94d93448bb9a0cd56c7756d7055.zip |
add support for key denylist
This is to filter out cluster rows where the resulting key is in a given
text file (one key per line).
The intent is to filter out records with either poor metadata, or very
generic metadata, for fuzzy matching. Eg, in many cases it is better to
just not try matching "Letter to the Editor" to any record. This won't
always be the case; we might have journal, volume, issue, and page,
which would allow a match. So this can be specified on the command line.
Diffstat (limited to 'notebooks')
0 files changed, 0 insertions, 0 deletions