diff options
author | Bryan Newbold <bnewbold@archive.org> | 2018-09-26 01:55:22 +0000 |
---|---|---|
committer | Bryan Newbold <bnewbold@archive.org> | 2018-09-26 01:55:22 +0000 |
commit | 0bbe8e1f6689da846944d60a53e620adc2b7622b (patch) | |
tree | 38cac67e061e4948d9cf1d17f64c25abca486635 /python/title_slug_blacklist.txt | |
parent | 7159fdf1ec55a4c9c096afb5eb1ce57b9a51f1e8 (diff) | |
download | sandcrawler-0bbe8e1f6689da846944d60a53e620adc2b7622b.tar.gz sandcrawler-0bbe8e1f6689da846944d60a53e620adc2b7622b.zip |
some progress on a crude grobid metadata filter
Diffstat (limited to 'python/title_slug_blacklist.txt')
l--------- | python/title_slug_blacklist.txt | 1 |
1 files changed, 1 insertions, 0 deletions
diff --git a/python/title_slug_blacklist.txt b/python/title_slug_blacklist.txt new file mode 120000 index 0000000..5bca386 --- /dev/null +++ b/python/title_slug_blacklist.txt @@ -0,0 +1 @@ +../scalding/src/main/resources/slug-denylist.txt
\ No newline at end of file |