aboutsummaryrefslogtreecommitdiffstats
diff options
context:
space:
mode:
authorMartin Czygan <martin.czygan@gmail.com>2020-11-28 19:52:04 +0100
committerMartin Czygan <martin.czygan@gmail.com>2020-11-28 19:52:04 +0100
commitfd8e41adb3af12d9fda198cc5918f5e411ac4530 (patch)
treef455717afc9ef60ae2c0acff19a72014e8ba7b62
parentdbe51fe929712f142c91ce3f4ffad600a2c2c8f6 (diff)
downloadfuzzycat-fd8e41adb3af12d9fda198cc5918f5e411ac4530.tar.gz
fuzzycat-fd8e41adb3af12d9fda198cc5918f5e411ac4530.zip
update notes
-rw-r--r--notes/2020_11_testruns.md29
1 files changed, 29 insertions, 0 deletions
diff --git a/notes/2020_11_testruns.md b/notes/2020_11_testruns.md
index 077111f..a8386e6 100644
--- a/notes/2020_11_testruns.md
+++ b/notes/2020_11_testruns.md
@@ -52,4 +52,33 @@ The cluster size distribution is:
6451 18
5233 19
4865 20
+ ...
+```
+
+Preliminary case distribution:
+
+```
+ 802360 Miss.CONTRIB_INTERSECTION_EMPTY
+ 798412 OK.TITLE_AUTHOR_MATCH
+ 690479 OK.SLUG_TITLE_AUTHOR_MATCH
+ 680827 OK.DUMMY
+ 510317 Miss.YEAR
+ 346331 OK.FIGSHARE_VERSION
+ 277427 OK.ARXIV_VERSION
+ 241549 Miss.DATASET_DOI
+ 239922 Miss.BOOK_CHAPTER
+ 192773 OK.DATACITE_RELATED_ID
+ 163599 OK.TOKENIZED_AUTHORS
+ 135608 Miss.RELEASE_TYPE
+ 59939 Miss.COMPONENT
+ 58373 Miss.SUBTITLE
+ 48654 OK.PREPRINT_PUBLISHED
+ 45455 Miss.SHORT_TITLE
+ 2960 OK.DOI
+ 1301 Miss.APPENDIX
+ 934 Miss.BLACKLISTED_FRAGMENT
+ 653 Miss.BLACKLISTED
+ 172 Miss.TITLE_FILENAME
+ 21 Miss.NUM_DIFF
+ 1
```