aboutsummaryrefslogtreecommitdiffstats
Commit message (Expand)AuthorAgeFilesLines
* switch to sligthly more performance string builderMartin Czygan2021-07-263-43/+41
* reduce: mention upcoming change to indexingMartin Czygan2021-07-261-1/+1
* Merge branch 'bnewbold-skate-tweaks' into 'master'Martin Czygan2021-07-269-54/+116
|\
| * skate: use SanitizeDOI in all inputsBryan Newbold2021-07-254-22/+9
| * skate: fast SanitizeDOI helper for normalizing DOIsBryan Newbold2021-07-252-0/+71
| * skate unstructured: don't parse DOI out of keyBryan Newbold2021-07-251-16/+0
| * skate: pass-through match_provenance in more situationsBryan Newbold2021-07-251-0/+2
| * schema: switch from '.name' to '.raw_name' for un-parsed CSL name fieldBryan Newbold2021-07-253-6/+6
| * skate: use date-parts for year, not 'raw'Bryan Newbold2021-07-252-8/+9
| * schema: have issued+accessed (CSLDate) actually omitemptyBryan Newbold2021-07-243-5/+5
| * add test for issued,accessed not being included in output JSONBryan Newbold2021-07-241-0/+17
* | ci: show coverageMartin Czygan2021-07-261-2/+1
* | add ci scriptMartin Czygan2021-07-261-0/+6
|/
* tasks: simplify url list taskMartin Czygan2021-07-231-4/+1
* tasks: update docsMartin Czygan2021-07-231-64/+15
* fix typo in ref schemaMartin Czygan2021-07-231-1/+1
* start mag notesMartin Czygan2021-07-221-0/+21
* v0.1.4Martin Czygan2021-07-221-1/+1
* update docsMartin Czygan2021-07-221-4/+1
* update makefileMartin Czygan2021-07-221-3/+0
* apply style fixesMartin Czygan2021-07-222-0/+4
* update READMEMartin Czygan2021-07-225-4/+39
* cli: show full pathMartin Czygan2021-07-221-1/+1
* cli: display TAG directoryMartin Czygan2021-07-221-0/+1
* add missing importMartin Czygan2021-07-221-0/+1
* add luigi as dependencyMartin Czygan2021-07-221-0/+1
* remove reference to gluishMartin Czygan2021-07-221-1/+1
* cleanup currently unused dependenciesMartin Czygan2021-07-224-18/+344
* v0.1.40Martin Czygan2021-07-221-1/+1
* cleanup (old) clustering related codeMartin Czygan2021-07-223-177/+39
* minor doc fixesMartin Czygan2021-07-212-4/+7
* xio: improve namingMartin Czygan2021-07-213-33/+30
* reduce: use fixed length sha1 for url id partMartin Czygan2021-07-201-3/+5
* tasks: increase default limit for cdxMartin Czygan2021-07-201-1/+1
* reduce: fix wb idMartin Czygan2021-07-201-1/+1
* reduce: a preliminary id for wb linksMartin Czygan2021-07-201-0/+5
* es indexing: update notesMartin Czygan2021-07-201-1/+2
* reduce: temp fix 0 source release yearMartin Czygan2021-07-191-1/+4
* update notes on es indexingMartin Czygan2021-07-191-0/+11
* cleanup another scriptMartin Czygan2021-07-175-311/+72
* cleanup skate-bref-idMartin Czygan2021-07-172-42/+1
* update indexing notesMartin Czygan2021-07-171-0/+38
* tasks: add data pointMartin Czygan2021-07-161-2/+3
* reduce: use correct reducerMartin Czygan2021-07-151-2/+2
* tasks: ignore exit code 141 for nowMartin Czygan2021-07-151-1/+1
* tasks: add BrefZipWaybackMartin Czygan2021-07-151-0/+20
* register reducerMartin Czygan2021-07-151-0/+14
* add ZippyWayback reducerMartin Czygan2021-07-153-54/+114
* tasks: reduce sample sizeMartin Czygan2021-07-151-1/+1
* tasks: tweak CDXURLMartin Czygan2021-07-151-2/+2