aboutsummaryrefslogtreecommitdiffstats
path: root/notes
diff options
context:
space:
mode:
authorMartin Czygan <martin.czygan@gmail.com>2020-11-28 16:51:40 +0100
committerMartin Czygan <martin.czygan@gmail.com>2020-11-28 16:51:40 +0100
commit834a23edfa931b12648dbff31d53ea98d272c125 (patch)
treeeea0e31e4b9a1fdac930b6a1b76ad87ff72ebc3e /notes
parent31f9d24b4dad839527daab3ded3d31f8be81f498 (diff)
downloadfuzzycat-834a23edfa931b12648dbff31d53ea98d272c125.tar.gz
fuzzycat-834a23edfa931b12648dbff31d53ea98d272c125.zip
note on cluster size distribution
Diffstat (limited to 'notes')
-rw-r--r--notes/2020_11_testruns.md24
1 files changed, 24 insertions, 0 deletions
diff --git a/notes/2020_11_testruns.md b/notes/2020_11_testruns.md
index 31c292c..077111f 100644
--- a/notes/2020_11_testruns.md
+++ b/notes/2020_11_testruns.md
@@ -29,3 +29,27 @@ $ time zstdcat -T0 cluster_tsandcrawler_min_cluster_size_2.json.zst | \
zstd -T0 -c9 > cluster_tsandcrawler_min_cluster_size_2_verify.tsv.zst
```
+The cluster size distribution is:
+
+```
+9086522 2
+1486742 3
+ 506125 4
+ 211335 5
+ 126678 6
+ 67592 7
+ 47085 8
+ 32587 9
+ 23975 10
+ 19153 11
+ 16318 12
+ 12167 100
+ 12051 13
+ 10345 14
+ 8687 15
+ 7418 16
+ 6655 17
+ 6451 18
+ 5233 19
+ 4865 20
+```