Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | reduce: temp bug fix for line cutter | Martin Czygan | 2021-07-13 | 1 | -1/+5 |
| | | | | | | | | we wanted to trim whitespace at one point, because values contained the separator values; however this breaks with empty values; move back to not trimming values except for the newline, when requesting the last value; moving forward, we need to clean or reject dirty values or use a different delimiter | ||||
* | reduce: small tweaks | Martin Czygan | 2021-07-13 | 1 | -3/+4 |
| | |||||
* | wip: csl logging | Martin Czygan | 2021-07-13 | 1 | -1/+1 |
| | |||||
* | update docs | Martin Czygan | 2021-07-13 | 1 | -1/+7 |
| | |||||
* | reduce/schema: add csl | Martin Czygan | 2021-07-13 | 1 | -1/+7 |
| | |||||
* | wiki: include lang in encoded page title | Martin Czygan | 2021-07-13 | 1 | -7/+12 |
| | |||||
* | reduce: add todo | Martin Czygan | 2021-07-13 | 1 | -1/+3 |
| | |||||
* | mock out time.Now for tests | Martin Czygan | 2021-07-13 | 1 | -3/+6 |
| | |||||
* | reduce: log broken line only | Martin Czygan | 2021-07-10 | 1 | -1/+1 |
| | |||||
* | reduce: add key and indexed ts for exact matches | Martin Czygan | 2021-07-10 | 1 | -0/+2 |
| | |||||
* | reduce: ol, fuzzy, w/ unstructured | Martin Czygan | 2021-07-10 | 1 | -1/+1 |
| | |||||
* | release to unstructured stub | Martin Czygan | 2021-07-10 | 1 | -2/+2 |
| | |||||
* | reduce: open library id tweaks | Martin Czygan | 2021-07-10 | 1 | -5/+27 |
| | |||||
* | reduce: tweak wiki bref | Martin Czygan | 2021-07-10 | 1 | -4/+5 |
| | |||||
* | reduce: filter out duplicate wiki links | Martin Czygan | 2021-07-10 | 1 | -0/+8 |
| | |||||
* | wiki: use lowercase base32 of page title | Martin Czygan | 2021-07-09 | 1 | -2/+3 |
| | | | | * mostly case insensitive, same case as ident | ||||
* | reduce: use a base64 encoded title as key | Martin Czygan | 2021-07-09 | 1 | -1/+7 |
| | |||||
* | reduce: wiki doc in column 3 | Martin Czygan | 2021-07-09 | 1 | -1/+1 |
| | |||||
* | reduce: move batch size | Martin Czygan | 2021-07-09 | 1 | -8/+6 |
| | |||||
* | reduce: set default batch size | Martin Czygan | 2021-07-08 | 1 | -6/+8 |
| | |||||
* | simplify imports | Martin Czygan | 2021-07-08 | 1 | -1/+1 |
| | |||||
* | reduce: separate batch calls | Martin Czygan | 2021-07-08 | 1 | -18/+18 |
| | |||||
* | reduce: remove log line | Martin Czygan | 2021-07-06 | 1 | -1/+0 |
| | |||||
* | reduce: move to threaded versions | Martin Czygan | 2021-07-06 | 1 | -25/+30 |
| | |||||
* | add resource usage note | Martin Czygan | 2021-07-06 | 1 | -1/+1 |
| | |||||
* | wip: improve reduce performance | Martin Czygan | 2021-07-06 | 1 | -50/+8 |
| | |||||
* | wip: debug with stdlib json | Martin Czygan | 2021-07-05 | 1 | -1/+3 |
| | |||||
* | we need a safe encoder, not just a safe writer | Martin Czygan | 2021-07-05 | 1 | -1/+1 |
| | |||||
* | reduce: hard-code batch size for testing | Martin Czygan | 2021-07-05 | 1 | -0/+1 |
| | |||||
* | test-run: batch reduce processing for performance | Martin Czygan | 2021-07-05 | 1 | -4/+6 |
| | |||||
* | reduce: reduce allocations | Martin Czygan | 2021-07-05 | 1 | -5/+24 |
| | |||||
* | use most sensible file name | Martin Czygan | 2021-07-05 | 1 | -0/+599 |
* we have map, so we should have reduce |