From fd8e41adb3af12d9fda198cc5918f5e411ac4530 Mon Sep 17 00:00:00 2001 From: Martin Czygan Date: Sat, 28 Nov 2020 19:52:04 +0100 Subject: update notes --- notes/2020_11_testruns.md | 29 +++++++++++++++++++++++++++++ 1 file changed, 29 insertions(+) diff --git a/notes/2020_11_testruns.md b/notes/2020_11_testruns.md index 077111f..a8386e6 100644 --- a/notes/2020_11_testruns.md +++ b/notes/2020_11_testruns.md @@ -52,4 +52,33 @@ The cluster size distribution is: 6451 18 5233 19 4865 20 + ... +``` + +Preliminary case distribution: + +``` + 802360 Miss.CONTRIB_INTERSECTION_EMPTY + 798412 OK.TITLE_AUTHOR_MATCH + 690479 OK.SLUG_TITLE_AUTHOR_MATCH + 680827 OK.DUMMY + 510317 Miss.YEAR + 346331 OK.FIGSHARE_VERSION + 277427 OK.ARXIV_VERSION + 241549 Miss.DATASET_DOI + 239922 Miss.BOOK_CHAPTER + 192773 OK.DATACITE_RELATED_ID + 163599 OK.TOKENIZED_AUTHORS + 135608 Miss.RELEASE_TYPE + 59939 Miss.COMPONENT + 58373 Miss.SUBTITLE + 48654 OK.PREPRINT_PUBLISHED + 45455 Miss.SHORT_TITLE + 2960 OK.DOI + 1301 Miss.APPENDIX + 934 Miss.BLACKLISTED_FRAGMENT + 653 Miss.BLACKLISTED + 172 Miss.TITLE_FILENAME + 21 Miss.NUM_DIFF + 1 ``` -- cgit v1.2.3