diff options
author | Martin Czygan <martin.czygan@gmail.com> | 2020-11-28 19:52:04 +0100 |
---|---|---|
committer | Martin Czygan <martin.czygan@gmail.com> | 2020-11-28 19:52:04 +0100 |
commit | fd8e41adb3af12d9fda198cc5918f5e411ac4530 (patch) | |
tree | f455717afc9ef60ae2c0acff19a72014e8ba7b62 /notes | |
parent | dbe51fe929712f142c91ce3f4ffad600a2c2c8f6 (diff) | |
download | fuzzycat-fd8e41adb3af12d9fda198cc5918f5e411ac4530.tar.gz fuzzycat-fd8e41adb3af12d9fda198cc5918f5e411ac4530.zip |
update notes
Diffstat (limited to 'notes')
-rw-r--r-- | notes/2020_11_testruns.md | 29 |
1 files changed, 29 insertions, 0 deletions
diff --git a/notes/2020_11_testruns.md b/notes/2020_11_testruns.md index 077111f..a8386e6 100644 --- a/notes/2020_11_testruns.md +++ b/notes/2020_11_testruns.md @@ -52,4 +52,33 @@ The cluster size distribution is: 6451 18 5233 19 4865 20 + ... +``` + +Preliminary case distribution: + +``` + 802360 Miss.CONTRIB_INTERSECTION_EMPTY + 798412 OK.TITLE_AUTHOR_MATCH + 690479 OK.SLUG_TITLE_AUTHOR_MATCH + 680827 OK.DUMMY + 510317 Miss.YEAR + 346331 OK.FIGSHARE_VERSION + 277427 OK.ARXIV_VERSION + 241549 Miss.DATASET_DOI + 239922 Miss.BOOK_CHAPTER + 192773 OK.DATACITE_RELATED_ID + 163599 OK.TOKENIZED_AUTHORS + 135608 Miss.RELEASE_TYPE + 59939 Miss.COMPONENT + 58373 Miss.SUBTITLE + 48654 OK.PREPRINT_PUBLISHED + 45455 Miss.SHORT_TITLE + 2960 OK.DOI + 1301 Miss.APPENDIX + 934 Miss.BLACKLISTED_FRAGMENT + 653 Miss.BLACKLISTED + 172 Miss.TITLE_FILENAME + 21 Miss.NUM_DIFF + 1 ``` |