Commit message (Collapse) | Author | Age | Files | Lines | |
---|---|---|---|---|---|
* | note on approach | Martin Czygan | 2020-09-04 | 1 | -0/+10 |
| | |||||
* | docs: add another case | Martin Czygan | 2020-09-04 | 1 | -0/+8 |
| | |||||
* | readme: fix typo | Martin Czygan | 2020-09-04 | 1 | -1/+1 |
| | |||||
* | docs: another duplicate | Martin Czygan | 2020-09-03 | 1 | -0/+6 |
| | |||||
* | docs: exact example | Martin Czygan | 2020-09-03 | 1 | -0/+4 |
| | |||||
* | update README | Martin Czygan | 2020-09-03 | 1 | -0/+7 |
| | |||||
* | docs: another kind of duplication (granularity) | Martin Czygan | 2020-09-03 | 1 | -0/+9 |
| | |||||
* | example of almost same title | Martin Czygan | 2020-09-03 | 1 | -0/+8 |
| | |||||
* | docs: add more issue examples | Martin Czygan | 2020-09-03 | 1 | -0/+5 |
| | |||||
* | another example for common title | Martin Czygan | 2020-09-03 | 1 | -0/+4 |
| | |||||
* | docs: versions | Martin Czygan | 2020-09-03 | 1 | -0/+5 |
| | |||||
* | docs: ambiguous titles | Martin Czygan | 2020-09-03 | 1 | -0/+8 |
| | |||||
* | docs: another example of a long title | Martin Czygan | 2020-09-03 | 1 | -0/+6 |
| | |||||
* | docs: another quality issue | Martin Czygan | 2020-09-03 | 1 | -0/+7 |
| | |||||
* | docs: common title issue | Martin Czygan | 2020-09-03 | 1 | -0/+12 |
| | |||||
* | docs: add link to issue | Martin Czygan | 2020-09-03 | 1 | -0/+2 |
| | |||||
* | update various docs; start data issue log | Martin Czygan | 2020-09-03 | 5 | -2/+26 |
| | |||||
* | add example grobid output | Martin Czygan | 2020-08-27 | 1 | -0/+195 |
| | |||||
* | README: add performance data point | Martin Czygan | 2020-08-27 | 2 | -0/+22 |
| | |||||
* | update project README | Martin Czygan | 2020-08-27 | 4 | -0/+22 |
| | |||||
* | move datasets to projects | Martin Czygan | 2020-08-27 | 4 | -0/+10 |
| | |||||
* | update notes | Martin Czygan | 2020-08-25 | 1 | -3/+4 |
| | |||||
* | datasets: add samples item | Martin Czygan | 2020-08-25 | 2 | -1/+1 |
| | |||||
* | start datasets section | Martin Czygan | 2020-08-25 | 2 | -0/+16 |
| | | | | | Datasets to run fuzzy matching over, including a way to download all inputs, run with various parameters, etc. | ||||
* | stub: command line | Martin Czygan | 2020-08-18 | 3 | -7/+18 |
| | |||||
* | serial name: no default path | Martin Czygan | 2020-08-17 | 1 | -1/+1 |
| | |||||
* | serial name: no default path | Martin Czygan | 2020-08-17 | 1 | -0/+2 |
| | |||||
* | ignore tmp | Martin Czygan | 2020-08-17 | 1 | -0/+1 |
| | |||||
* | matching: verify release match stub | Martin Czygan | 2020-08-17 | 1 | -2/+24 |
| | |||||
* | tests: add stub | Martin Czygan | 2020-08-17 | 1 | -0/+5 |
| | |||||
* | matching: verify container can verify serial name first | Martin Czygan | 2020-08-17 | 1 | -2/+7 |
| | |||||
* | add stub script | Martin Czygan | 2020-08-17 | 2 | -0/+9 |
| | |||||
* | matching: two stage verification | Martin Czygan | 2020-08-17 | 1 | -18/+29 |
| | |||||
* | large overhaul | Martin Czygan | 2020-08-17 | 14 | -234/+577 |
| | | | | | | * separate all fatcat related code into fatcat submodule * more type annotations * add verify_serial_name for journal names | ||||
* | issn: simhash example | Martin Czygan | 2020-08-17 | 2 | -0/+20 |
| | |||||
* | add notes on abbrevs | Martin Czygan | 2020-08-15 | 3 | -1/+2261 |
| | |||||
* | include original and normalized name in default shelve (1G) | Martin Czygan | 2020-08-15 | 3 | -8/+16 |
| | |||||
* | separate cleanups | Martin Czygan | 2020-08-15 | 2 | -0/+47 |
| | |||||
* | cleanup handling: add parameter | Martin Czygan | 2020-08-15 | 4 | -19/+26 |
| | | | | allow string cleanup be called directly | ||||
* | update static files | Martin Czygan | 2020-08-15 | 2 | -1/+3 |
| | |||||
* | add extra files | Martin Czygan | 2020-08-15 | 3 | -0/+17 |
| | |||||
* | try out shelve for name lookups | Martin Czygan | 2020-08-15 | 1 | -10/+62 |
| | | | | | uncompressed about 500 MB; marisa-trie would need extra encoding approach (plus it is a heavy dependency). | ||||
* | update README | Martin Czygan | 2020-08-15 | 1 | -1/+5 |
| | |||||
* | issn: pair with issnl | Martin Czygan | 2020-08-14 | 1 | -19/+26 |
| | |||||
* | update plan | Martin Czygan | 2020-08-14 | 1 | -0/+5 |
| | |||||
* | add de-jsonld flag | Martin Czygan | 2020-08-14 | 1 | -15/+57 |
| | |||||
* | issn: jsonld breakup | Martin Czygan | 2020-08-13 | 1 | -25/+190 |
| | |||||
* | update journal name notebook | Martin Czygan | 2020-08-13 | 1 | -434/+442 |
| | |||||
* | update notebook | Martin Czygan | 2020-08-12 | 1 | -86/+729 |
| | |||||
* | update README | Martin Czygan | 2020-08-12 | 1 | -1/+3 |
| |