aboutsummaryrefslogtreecommitdiffstats
path: root/skate
Commit message (Expand)AuthorAgeFilesLines
* skate: use SanitizeDOI in all inputsBryan Newbold2021-07-254-22/+9
* skate: fast SanitizeDOI helper for normalizing DOIsBryan Newbold2021-07-252-0/+71
* skate unstructured: don't parse DOI out of keyBryan Newbold2021-07-251-16/+0
* skate: pass-through match_provenance in more situationsBryan Newbold2021-07-251-0/+2
* schema: switch from '.name' to '.raw_name' for un-parsed CSL name fieldBryan Newbold2021-07-253-6/+6
* skate: use date-parts for year, not 'raw'Bryan Newbold2021-07-252-8/+9
* schema: have issued+accessed (CSLDate) actually omitemptyBryan Newbold2021-07-243-5/+5
* add test for issued,accessed not being included in output JSONBryan Newbold2021-07-241-0/+17
* fix typo in ref schemaMartin Czygan2021-07-231-1/+1
* v0.1.40Martin Czygan2021-07-221-1/+1
* cleanup (old) clustering related codeMartin Czygan2021-07-223-177/+39
* minor doc fixesMartin Czygan2021-07-212-4/+7
* xio: improve namingMartin Czygan2021-07-213-33/+30
* reduce: use fixed length sha1 for url id partMartin Czygan2021-07-201-3/+5
* reduce: fix wb idMartin Czygan2021-07-201-1/+1
* reduce: a preliminary id for wb linksMartin Czygan2021-07-201-0/+5
* reduce: temp fix 0 source release yearMartin Czygan2021-07-191-1/+4
* cleanup another scriptMartin Czygan2021-07-175-311/+72
* cleanup skate-bref-idMartin Czygan2021-07-172-42/+1
* reduce: use correct reducerMartin Czygan2021-07-151-2/+2
* register reducerMartin Czygan2021-07-151-0/+14
* add ZippyWayback reducerMartin Czygan2021-07-153-54/+114
* mapper: add cdxuMartin Czygan2021-07-152-0/+22
* map: add another mapperMartin Czygan2021-07-152-3/+17
* update docsMartin Czygan2021-07-142-11/+11
* reduce: add testMartin Czygan2021-07-142-18/+41
* reduce: add todoMartin Czygan2021-07-141-0/+2
* v0.1.39Martin Czygan2021-07-141-1/+1
* reduce: add csl fieldMartin Czygan2021-07-144-8/+72
* reduce: fix off-by-one errorMartin Czygan2021-07-142-2/+2
* reduce: temp bug fix for line cutterMartin Czygan2021-07-132-32/+61
* v0.1.38Martin Czygan2021-07-131-1/+1
* reduce: small tweaksMartin Czygan2021-07-132-6/+7
* fix typoMartin Czygan2021-07-131-1/+1
* wip: csl loggingMartin Czygan2021-07-131-1/+1
* update docsMartin Czygan2021-07-131-1/+7
* reduce/schema: add cslMartin Czygan2021-07-133-5/+70
* wiki: include lang in encoded page titleMartin Czygan2021-07-132-8/+18
* reduce: add todoMartin Czygan2021-07-131-1/+3
* separate slugify functionsMartin Czygan2021-07-134-28/+39
* mock out time.Now for testsMartin Czygan2021-07-134-1034/+1041
* reduce: log broken line onlyMartin Czygan2021-07-101-1/+1
* reduce: add key and indexed ts for exact matchesMartin Czygan2021-07-101-0/+2
* batch: drop loggingMartin Czygan2021-07-101-4/+0
* batch: log batch sizeMartin Czygan2021-07-101-1/+1
* reduce: short circuit large groupsMartin Czygan2021-07-101-2/+12
* schema: prefer isbn13Martin Czygan2021-07-101-1/+5
* schema: render isbn as wellMartin Czygan2021-07-101-1/+7
* reduce: ol, fuzzy, w/ unstructuredMartin Czygan2021-07-101-1/+1
* schema: add testMartin Czygan2021-07-102-0/+20