Commit message (Collapse) | Author | Age | Files | Lines | ||
---|---|---|---|---|---|---|
... | ||||||
* | | add ci script | Martin Czygan | 2021-07-26 | 1 | -0/+6 | |
|/ | ||||||
* | tasks: simplify url list task | Martin Czygan | 2021-07-23 | 1 | -4/+1 | |
| | ||||||
* | tasks: update docs | Martin Czygan | 2021-07-23 | 1 | -64/+15 | |
| | ||||||
* | fix typo in ref schema | Martin Czygan | 2021-07-23 | 1 | -1/+1 | |
| | ||||||
* | start mag notes | Martin Czygan | 2021-07-22 | 1 | -0/+21 | |
| | ||||||
* | v0.1.4 | Martin Czygan | 2021-07-22 | 1 | -1/+1 | |
| | ||||||
* | update docs | Martin Czygan | 2021-07-22 | 1 | -4/+1 | |
| | ||||||
* | update makefile | Martin Czygan | 2021-07-22 | 1 | -3/+0 | |
| | ||||||
* | apply style fixes | Martin Czygan | 2021-07-22 | 2 | -0/+4 | |
| | ||||||
* | update README | Martin Czygan | 2021-07-22 | 5 | -4/+39 | |
| | ||||||
* | cli: show full path | Martin Czygan | 2021-07-22 | 1 | -1/+1 | |
| | ||||||
* | cli: display TAG directory | Martin Czygan | 2021-07-22 | 1 | -0/+1 | |
| | ||||||
* | add missing import | Martin Czygan | 2021-07-22 | 1 | -0/+1 | |
| | ||||||
* | add luigi as dependency | Martin Czygan | 2021-07-22 | 1 | -0/+1 | |
| | ||||||
* | remove reference to gluish | Martin Czygan | 2021-07-22 | 1 | -1/+1 | |
| | ||||||
* | cleanup currently unused dependencies | Martin Czygan | 2021-07-22 | 4 | -18/+344 | |
| | | | | code from gluish copied into base.py | |||||
* | v0.1.40 | Martin Czygan | 2021-07-22 | 1 | -1/+1 | |
| | ||||||
* | cleanup (old) clustering related code | Martin Czygan | 2021-07-22 | 3 | -177/+39 | |
| | ||||||
* | minor doc fixes | Martin Czygan | 2021-07-21 | 2 | -4/+7 | |
| | ||||||
* | xio: improve naming | Martin Czygan | 2021-07-21 | 3 | -33/+30 | |
| | ||||||
* | reduce: use fixed length sha1 for url id part | Martin Czygan | 2021-07-20 | 1 | -3/+5 | |
| | | | | | base32 would occassionally exceed elasticsearch id field limit ("must be no longer than 512 bytes but was: 649") | |||||
* | tasks: increase default limit for cdx | Martin Czygan | 2021-07-20 | 1 | -1/+1 | |
| | ||||||
* | reduce: fix wb id | Martin Czygan | 2021-07-20 | 1 | -1/+1 | |
| | ||||||
* | reduce: a preliminary id for wb links | Martin Czygan | 2021-07-20 | 1 | -0/+5 | |
| | ||||||
* | es indexing: update notes | Martin Czygan | 2021-07-20 | 1 | -1/+2 | |
| | ||||||
* | reduce: temp fix 0 source release year | Martin Czygan | 2021-07-19 | 1 | -1/+4 | |
| | ||||||
* | update notes on es indexing | Martin Czygan | 2021-07-19 | 1 | -0/+11 | |
| | ||||||
* | cleanup another script | Martin Czygan | 2021-07-17 | 5 | -311/+72 | |
| | ||||||
* | cleanup skate-bref-id | Martin Czygan | 2021-07-17 | 2 | -42/+1 | |
| | ||||||
* | update indexing notes | Martin Czygan | 2021-07-17 | 1 | -0/+38 | |
| | ||||||
* | tasks: add data point | Martin Czygan | 2021-07-16 | 1 | -2/+3 | |
| | ||||||
* | reduce: use correct reducer | Martin Czygan | 2021-07-15 | 1 | -2/+2 | |
| | ||||||
* | tasks: ignore exit code 141 for now | Martin Czygan | 2021-07-15 | 1 | -1/+1 | |
| | ||||||
* | tasks: add BrefZipWayback | Martin Czygan | 2021-07-15 | 1 | -0/+20 | |
| | ||||||
* | register reducer | Martin Czygan | 2021-07-15 | 1 | -0/+14 | |
| | ||||||
* | add ZippyWayback reducer | Martin Czygan | 2021-07-15 | 3 | -54/+114 | |
| | ||||||
* | tasks: reduce sample size | Martin Czygan | 2021-07-15 | 1 | -1/+1 | |
| | ||||||
* | tasks: tweak CDXURL | Martin Czygan | 2021-07-15 | 1 | -2/+2 | |
| | ||||||
* | tasks: fix command | Martin Czygan | 2021-07-15 | 1 | -0/+1 | |
| | ||||||
* | tasks: tweak CDXURL | Martin Czygan | 2021-07-15 | 1 | -3/+5 | |
| | ||||||
* | tasks: add CDXURL | Martin Czygan | 2021-07-15 | 1 | -0/+28 | |
| | ||||||
* | mapper: add cdxu | Martin Czygan | 2021-07-15 | 2 | -0/+22 | |
| | ||||||
* | tasks: cleanup urls | Martin Czygan | 2021-07-15 | 1 | -0/+1 | |
| | ||||||
* | notes: add unique example | Martin Czygan | 2021-07-15 | 1 | -1/+1 | |
| | ||||||
* | tasks: add RefsURL | Martin Czygan | 2021-07-15 | 1 | -0/+26 | |
| | ||||||
* | map: add another mapper | Martin Czygan | 2021-07-15 | 2 | -3/+17 | |
| | ||||||
* | cdx reshape: only include hits | Martin Czygan | 2021-07-15 | 1 | -2/+1 | |
| | ||||||
* | cdx reshape: write json | Martin Czygan | 2021-07-15 | 1 | -2/+2 | |
| | ||||||
* | extra: cdx reshape | Martin Czygan | 2021-07-15 | 1 | -0/+19 | |
| | ||||||
* | update notes | Martin Czygan | 2021-07-15 | 1 | -0/+9 | |
| |