From 3e5aa503d69f6090698d55e1f03648b4628be069 Mon Sep 17 00:00:00 2001 From: Martin Czygan Date: Thu, 22 Oct 2020 09:54:18 +0200 Subject: notes: clustering --- notes/Clustering.md | 11 +++++++++++ 1 file changed, 11 insertions(+) create mode 100644 notes/Clustering.md (limited to 'notes') diff --git a/notes/Clustering.md b/notes/Clustering.md new file mode 100644 index 0000000..754852d --- /dev/null +++ b/notes/Clustering.md @@ -0,0 +1,11 @@ +# Clustering + +Original dataset: + +``` +$ sha1sum release_export_expanded.json.zst + +$ zstdcat -T0 release_export_expanded.json.zst | wc -l +``` + + -- cgit v1.2.3