diff options
author | Martin Czygan <martin.czygan@gmail.com> | 2020-11-28 16:51:40 +0100 |
---|---|---|
committer | Martin Czygan <martin.czygan@gmail.com> | 2020-11-28 16:51:40 +0100 |
commit | 834a23edfa931b12648dbff31d53ea98d272c125 (patch) | |
tree | eea0e31e4b9a1fdac930b6a1b76ad87ff72ebc3e | |
parent | 31f9d24b4dad839527daab3ded3d31f8be81f498 (diff) | |
download | fuzzycat-834a23edfa931b12648dbff31d53ea98d272c125.tar.gz fuzzycat-834a23edfa931b12648dbff31d53ea98d272c125.zip |
note on cluster size distribution
-rw-r--r-- | notes/2020_11_testruns.md | 24 |
1 files changed, 24 insertions, 0 deletions
diff --git a/notes/2020_11_testruns.md b/notes/2020_11_testruns.md index 31c292c..077111f 100644 --- a/notes/2020_11_testruns.md +++ b/notes/2020_11_testruns.md @@ -29,3 +29,27 @@ $ time zstdcat -T0 cluster_tsandcrawler_min_cluster_size_2.json.zst | \ zstd -T0 -c9 > cluster_tsandcrawler_min_cluster_size_2_verify.tsv.zst ``` +The cluster size distribution is: + +``` +9086522 2 +1486742 3 + 506125 4 + 211335 5 + 126678 6 + 67592 7 + 47085 8 + 32587 9 + 23975 10 + 19153 11 + 16318 12 + 12167 100 + 12051 13 + 10345 14 + 8687 15 + 7418 16 + 6655 17 + 6451 18 + 5233 19 + 4865 20 +``` |