Commit message (Collapse) | Author | Age | Files | Lines | ||
---|---|---|---|---|---|---|
... | ||||||
* | update stats | Martin Czygan | 2020-11-28 | 1 | -21/+21 | |
| | ||||||
* | update stats | Martin Czygan | 2020-11-28 | 1 | -21/+21 | |
| | ||||||
* | update notes | Martin Czygan | 2020-11-28 | 1 | -0/+29 | |
| | ||||||
* | note on cluster size distribution | Martin Czygan | 2020-11-28 | 1 | -0/+24 | |
| | ||||||
* | subtitle: default to list | Martin Czygan | 2020-11-27 | 1 | -0/+31 | |
| | ||||||
* | cleanup | Martin Czygan | 2020-11-24 | 5 | -401/+0 | |
| | ||||||
* | cluster notes | Martin Czygan | 2020-11-11 | 1 | -0/+19 | |
| | ||||||
* | README should not be notes | Martin Czygan | 2020-11-05 | 1 | -0/+197 | |
| | ||||||
* | cleanup notes | Martin Czygan | 2020-10-31 | 1 | -3/+0 | |
| | ||||||
* | update workflow notes | Martin Czygan | 2020-10-31 | 1 | -0/+6 | |
| | ||||||
* | note on workflow | Martin Czygan | 2020-10-31 | 1 | -3/+3 | |
| | ||||||
* | move around notes | Martin Czygan | 2020-10-31 | 5 | -2258/+1 | |
| | ||||||
* | update notes on cluster, nb | Martin Czygan | 2020-10-22 | 1 | -1/+47 | |
| | ||||||
* | update notes on clustering | Martin Czygan | 2020-10-22 | 1 | -0/+18 | |
| | ||||||
* | update cluster notes | Martin Czygan | 2020-10-22 | 1 | -0/+27 | |
| | ||||||
* | notes: clustering | Martin Czygan | 2020-10-22 | 1 | -0/+11 | |
| | ||||||
* | cluster variants | Martin Czygan | 2020-10-21 | 1 | -0/+54 | |
| | ||||||
* | update various docs; start data issue log | Martin Czygan | 2020-09-03 | 2 | -1/+1 | |
| | ||||||
* | add notes on abbrevs | Martin Czygan | 2020-08-15 | 2 | -0/+2260 | |
| | ||||||
* | update plan | Martin Czygan | 2020-08-14 | 1 | -0/+5 | |
| | ||||||
* | note on optimization: marisa-trie | Martin Czygan | 2020-08-12 | 1 | -0/+1 | |
| | | | | | | | | | | Currently, the JSON mapping is 172M, turning this into a dict takes a bit, plus consumes GBs of memory. For exact lookups, we might want to use marisa-trie: > String data in a MARISA-trie may take up to 50x-100x less memory than in a standard Python dict; the raw lookup speed is comparable; trie also provides fast advanced methods like prefix search. | |||||
* | add notes/todo | Martin Czygan | 2020-08-12 | 1 | -0/+17 | |