Commit message (Expand) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | add ZippyWayback reducer | Martin Czygan | 2021-07-15 | 3 | -54/+114 |
* | mapper: add cdxu | Martin Czygan | 2021-07-15 | 2 | -0/+22 |
* | map: add another mapper | Martin Czygan | 2021-07-15 | 2 | -3/+17 |
* | update docs | Martin Czygan | 2021-07-14 | 2 | -11/+11 |
* | reduce: add test | Martin Czygan | 2021-07-14 | 2 | -18/+41 |
* | reduce: add todo | Martin Czygan | 2021-07-14 | 1 | -0/+2 |
* | v0.1.39 | Martin Czygan | 2021-07-14 | 1 | -1/+1 |
* | reduce: add csl field | Martin Czygan | 2021-07-14 | 4 | -8/+72 |
* | reduce: fix off-by-one error | Martin Czygan | 2021-07-14 | 2 | -2/+2 |
* | reduce: temp bug fix for line cutter | Martin Czygan | 2021-07-13 | 2 | -32/+61 |
* | v0.1.38 | Martin Czygan | 2021-07-13 | 1 | -1/+1 |
* | reduce: small tweaks | Martin Czygan | 2021-07-13 | 2 | -6/+7 |
* | fix typo | Martin Czygan | 2021-07-13 | 1 | -1/+1 |
* | wip: csl logging | Martin Czygan | 2021-07-13 | 1 | -1/+1 |
* | update docs | Martin Czygan | 2021-07-13 | 1 | -1/+7 |
* | reduce/schema: add csl | Martin Czygan | 2021-07-13 | 3 | -5/+70 |
* | wiki: include lang in encoded page title | Martin Czygan | 2021-07-13 | 2 | -8/+18 |
* | reduce: add todo | Martin Czygan | 2021-07-13 | 1 | -1/+3 |
* | separate slugify functions | Martin Czygan | 2021-07-13 | 4 | -28/+39 |
* | mock out time.Now for tests | Martin Czygan | 2021-07-13 | 4 | -1034/+1041 |
* | reduce: log broken line only | Martin Czygan | 2021-07-10 | 1 | -1/+1 |
* | reduce: add key and indexed ts for exact matches | Martin Czygan | 2021-07-10 | 1 | -0/+2 |
* | batch: drop logging | Martin Czygan | 2021-07-10 | 1 | -4/+0 |
* | batch: log batch size | Martin Czygan | 2021-07-10 | 1 | -1/+1 |
* | reduce: short circuit large groups | Martin Czygan | 2021-07-10 | 1 | -2/+12 |
* | schema: prefer isbn13 | Martin Czygan | 2021-07-10 | 1 | -1/+5 |
* | schema: render isbn as well | Martin Czygan | 2021-07-10 | 1 | -1/+7 |
* | reduce: ol, fuzzy, w/ unstructured | Martin Czygan | 2021-07-10 | 1 | -1/+1 |
* | schema: add test | Martin Czygan | 2021-07-10 | 2 | -0/+20 |
* | schema: flesh our unstructured rendering | Martin Czygan | 2021-07-10 | 2 | -0/+56 |
* | release to unstructured stub | Martin Czygan | 2021-07-10 | 3 | -2/+84 |
* | reduce: open library id tweaks | Martin Czygan | 2021-07-10 | 1 | -5/+27 |
* | reduce: tweak wiki bref | Martin Czygan | 2021-07-10 | 1 | -4/+5 |
* | reduce: filter out duplicate wiki links | Martin Czygan | 2021-07-10 | 1 | -0/+8 |
* | wiki: use lowercase base32 of page title | Martin Czygan | 2021-07-09 | 1 | -2/+3 |
* | reduce: use a base64 encoded title as key | Martin Czygan | 2021-07-09 | 1 | -1/+7 |
* | wiki: cleanup redundant check | Martin Czygan | 2021-07-09 | 1 | -1/+1 |
* | wiki: tweak whitespace handling | Martin Czygan | 2021-07-09 | 1 | -1/+7 |
* | wiki: more aggressive whitespace cleanup | Martin Czygan | 2021-07-09 | 1 | -1/+2 |
* | wiki: try a bit more cleanup | Martin Czygan | 2021-07-09 | 1 | -1/+5 |
* | wiki: verify doi | Martin Czygan | 2021-07-09 | 1 | -1/+1 |
* | unstructured: cleanup obsolete regex | Martin Czygan | 2021-07-09 | 1 | -9/+3 |
* | reduce: wiki doc in column 3 | Martin Czygan | 2021-07-09 | 1 | -1/+1 |
* | tests: sync verify test data | Martin Czygan | 2021-07-09 | 6 | -0/+176 |
* | wiki: flip doi and page title column | Martin Czygan | 2021-07-09 | 1 | -3/+3 |
* | reduce: move batch size | Martin Czygan | 2021-07-09 | 2 | -9/+9 |
* | reduce: prepare command line help | Martin Czygan | 2021-07-08 | 1 | -0/+12 |
* | update docs | Martin Czygan | 2021-07-08 | 1 | -3/+3 |
* | reduce: set default batch size | Martin Czygan | 2021-07-08 | 1 | -6/+8 |
* | simplify imports | Martin Czygan | 2021-07-08 | 9 | -9/+9 |