Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | rename module to gitlab.com/internetarchive/refcat | Martin Czygan | 2021-10-20 | 17 | -30/+30 |
| | | | | | This changes all the import paths to the current canonical location on http://gitlab.com/internetarchive/refcat. | ||||
* | makefile: go get is deprecated | Martin Czygan | 2021-09-27 | 1 | -1/+1 |
| | |||||
* | misc: fix and improve comments | Martin Czygan | 2021-09-23 | 4 | -5/+28 |
| | |||||
* | verify: fix link in log message | Martin Czygan | 2021-09-23 | 1 | -1/+1 |
| | |||||
* | ignore notes in progress | Martin Czygan | 2021-08-16 | 1 | -0/+1 |
| | |||||
* | map: add note on input data issue | Martin Czygan | 2021-08-05 | 1 | -0/+2 |
| | |||||
* | map: only include complete rows | Martin Czygan | 2021-08-05 | 1 | -0/+3 |
| | |||||
* | adjust schema tests | Martin Czygan | 2021-08-05 | 1 | -2/+2 |
| | |||||
* | tasks: use a mapper | Martin Czygan | 2021-08-05 | 3 | -0/+22 |
| | |||||
* | cleanup: add stub ref cleanup | Martin Czygan | 2021-08-05 | 2 | -1/+18 |
| | |||||
* | skate: move cleanup code out | Martin Czygan | 2021-08-05 | 4 | -62/+101 |
| | |||||
* | update docs | Martin Czygan | 2021-08-03 | 1 | -0/+2 |
| | |||||
* | update notes | Martin Czygan | 2021-08-02 | 1 | -0/+20 |
| | |||||
* | v0.2.4 | Martin Czygan | 2021-08-02 | 1 | -1/+1 |
| | |||||
* | mapper: add bref, rewo mappers | Martin Czygan | 2021-08-02 | 2 | -0/+21 |
| | |||||
* | xio: mention lock | Martin Czygan | 2021-07-31 | 1 | -1/+1 |
| | | | | | there are other lock-free implementations, such as https://github.com/cloudfoundry/go-diodes | ||||
* | v0.2.3 | Martin Czygan | 2021-07-28 | 1 | -1/+1 |
| | |||||
* | reduce: remove log statements | Martin Czygan | 2021-07-28 | 1 | -4/+0 |
| | |||||
* | reduce: adjust test | Martin Czygan | 2021-07-28 | 1 | -3/+3 |
| | |||||
* | reduce: fix test files | Martin Czygan | 2021-07-28 | 2 | -52/+104 |
| | |||||
* | adjust reduce test cases | Martin Czygan | 2021-07-28 | 4 | -1087/+1087 |
| | |||||
* | v0.2.2 | Martin Czygan | 2021-07-28 | 1 | -1/+1 |
| | |||||
* | leave ref.index unchanged | Martin Czygan | 2021-07-28 | 1 | -6/+6 |
| | | | | | | | | | previously, we started with 0-indexed input, but wanted 1-indexed values so we added increments at various points which probably lead to bug (missing refs), since at one point we would fuse the original ref data (w/o increments) with the matched data (w/ increments); with scholar:528804ad2e55983cf3e5e6659d8f46db0cab02b7 we can now leave indices as is | ||||
* | reduce: add case | Martin Czygan | 2021-07-28 | 5 | -0/+93 |
| | |||||
* | reduce: add more logging, temporarily | Martin Czygan | 2021-07-27 | 1 | -1/+6 |
| | |||||
* | add bref augment scaffold | Martin Czygan | 2021-07-27 | 4 | -0/+43 |
| | |||||
* | tests: use more sensible names | Martin Czygan | 2021-07-27 | 2 | -6/+6 |
| | |||||
* | move test utilities into testutil | Martin Czygan | 2021-07-27 | 2 | -32/+41 |
| | |||||
* | annotate test cases | Martin Czygan | 2021-07-27 | 1 | -5/+12 |
| | |||||
* | update docs | Martin Czygan | 2021-07-27 | 1 | -4/+1 |
| | |||||
* | update dependencies | Martin Czygan | 2021-07-27 | 2 | -6/+0 |
| | |||||
* | remove unused/partially implemented skate-dot for now | Martin Czygan | 2021-07-27 | 2 | -75/+1 |
| | |||||
* | minor tweaks and doc improvements | Martin Czygan | 2021-07-27 | 2 | -49/+45 |
| | |||||
* | update todo notes | Martin Czygan | 2021-07-27 | 1 | -2/+22 |
| | |||||
* | reuse timestamps | Martin Czygan | 2021-07-27 | 1 | -6/+14 |
| | | | | | | | | | | while time.Now is not really slow, thanks to vDSO (cf. https://git.io/J4SOH), it will be even faster to just call it once at the start of the processing; also: https://twitter.com/davidcrawshaw/status/1414243408936280073 > Turns out http://time.Now was taking its usual amount of time on linux, ~50 nanoseconds [...] | ||||
* | reduce: explicitly name magic numbers | Martin Czygan | 2021-07-27 | 1 | -3/+8 |
| | |||||
* | schema: add note regarding field name | Martin Czygan | 2021-07-27 | 1 | -1/+1 |
| | |||||
* | v0.2.1 | Martin Czygan | 2021-07-27 | 1 | -1/+1 |
| | |||||
* | schema: tweaks | Martin Czygan | 2021-07-27 | 2 | -5/+112 |
| | | | | add String() to CSLDate; we only cover a few typical cases | ||||
* | cleanup and docs | Martin Czygan | 2021-07-27 | 1 | -25/+4 |
| | |||||
* | reduce: use pascal case | Martin Czygan | 2021-07-26 | 1 | -2/+2 |
| | |||||
* | v0.2.0 | Martin Czygan | 2021-07-26 | 1 | -1/+1 |
| | | | | | | | | | | | | lots of tweaks * normalize (eg, lower-case) DOIs in all (or at least most?) situations, especially for equality comparisons * don't try to parse DOI from ref_key (which may contain a source DOI, but not a target DOI) * switch to using date-parts for year in target_csl output * switch from author.name to author.raw_name in target_csl output (neither are standard; raw_name indicates this better) * pass through match_provenance in unmatched case * in target_csl output, don't always include issued and accessed dates as * empty objects (could save significant ES index disk space?) | ||||
* | switch to sligthly more performance string builder | Martin Czygan | 2021-07-26 | 3 | -43/+41 |
| | |||||
* | reduce: mention upcoming change to indexing | Martin Czygan | 2021-07-26 | 1 | -1/+1 |
| | | | | see: scholar:528804ad2e55983cf3e5e6659d8f46db0cab02b7 | ||||
* | skate: use SanitizeDOI in all inputs | Bryan Newbold | 2021-07-25 | 4 | -22/+9 |
| | |||||
* | skate: fast SanitizeDOI helper for normalizing DOIs | Bryan Newbold | 2021-07-25 | 2 | -0/+71 |
| | |||||
* | skate unstructured: don't parse DOI out of key | Bryan Newbold | 2021-07-25 | 1 | -16/+0 |
| | | | | | | DOIs in keys, usually from Crossref, are the DOI of the *source* of the reference, not the *target* of the reference. Thus, they should not be parsed and copied to the ref.biblio.doi field. | ||||
* | skate: pass-through match_provenance in more situations | Bryan Newbold | 2021-07-25 | 1 | -0/+2 |
| | |||||
* | schema: switch from '.name' to '.raw_name' for un-parsed CSL name field | Bryan Newbold | 2021-07-25 | 3 | -6/+6 |
| | |||||
* | skate: use date-parts for year, not 'raw' | Bryan Newbold | 2021-07-25 | 2 | -8/+9 |
| |