diff options
Diffstat (limited to 'projects/README.md')
-rw-r--r-- | projects/README.md | 17 |
1 files changed, 17 insertions, 0 deletions
diff --git a/projects/README.md b/projects/README.md new file mode 100644 index 0000000..bfbbaef --- /dev/null +++ b/projects/README.md @@ -0,0 +1,17 @@ +# Datasets + +Example datasets for fuzzycat, fatcat fuzzy matching utilities. + +* repo: [fuzycat](https://github.com/miku/fuzzycat) +* data: [fuzzycat_samples](https://archive.org/details/fuzzycat_samples) + +## Grobid References (grobid_refs) + +## Title list (titlelist) + +## Name only containers (name_only_containers) + +## OAI harvest metadata + +* [https://archive.org/details/oai_harvest_20200215](https://archive.org/details/oai_harvest_20200215) +* [oai.ndjson.zst](https://archive.org/download/oai_harvest_20200215/oai.ndjson.zst) |