aboutsummaryrefslogtreecommitdiffstats
Commit message (Collapse)AuthorAgeFilesLines
* note on approachMartin Czygan2020-09-041-0/+10
|
* docs: add another caseMartin Czygan2020-09-041-0/+8
|
* readme: fix typoMartin Czygan2020-09-041-1/+1
|
* docs: another duplicateMartin Czygan2020-09-031-0/+6
|
* docs: exact exampleMartin Czygan2020-09-031-0/+4
|
* update READMEMartin Czygan2020-09-031-0/+7
|
* docs: another kind of duplication (granularity)Martin Czygan2020-09-031-0/+9
|
* example of almost same titleMartin Czygan2020-09-031-0/+8
|
* docs: add more issue examplesMartin Czygan2020-09-031-0/+5
|
* another example for common titleMartin Czygan2020-09-031-0/+4
|
* docs: versionsMartin Czygan2020-09-031-0/+5
|
* docs: ambiguous titlesMartin Czygan2020-09-031-0/+8
|
* docs: another example of a long titleMartin Czygan2020-09-031-0/+6
|
* docs: another quality issueMartin Czygan2020-09-031-0/+7
|
* docs: common title issueMartin Czygan2020-09-031-0/+12
|
* docs: add link to issueMartin Czygan2020-09-031-0/+2
|
* update various docs; start data issue logMartin Czygan2020-09-035-2/+26
|
* add example grobid outputMartin Czygan2020-08-271-0/+195
|
* README: add performance data pointMartin Czygan2020-08-272-0/+22
|
* update project READMEMartin Czygan2020-08-274-0/+22
|
* move datasets to projectsMartin Czygan2020-08-274-0/+10
|
* update notesMartin Czygan2020-08-251-3/+4
|
* datasets: add samples itemMartin Czygan2020-08-252-1/+1
|
* start datasets sectionMartin Czygan2020-08-252-0/+16
| | | | | Datasets to run fuzzy matching over, including a way to download all inputs, run with various parameters, etc.
* stub: command lineMartin Czygan2020-08-183-7/+18
|
* serial name: no default pathMartin Czygan2020-08-171-1/+1
|
* serial name: no default pathMartin Czygan2020-08-171-0/+2
|
* ignore tmpMartin Czygan2020-08-171-0/+1
|
* matching: verify release match stubMartin Czygan2020-08-171-2/+24
|
* tests: add stubMartin Czygan2020-08-171-0/+5
|
* matching: verify container can verify serial name firstMartin Czygan2020-08-171-2/+7
|
* add stub scriptMartin Czygan2020-08-172-0/+9
|
* matching: two stage verificationMartin Czygan2020-08-171-18/+29
|
* large overhaulMartin Czygan2020-08-1714-234/+577
| | | | | | * separate all fatcat related code into fatcat submodule * more type annotations * add verify_serial_name for journal names
* issn: simhash exampleMartin Czygan2020-08-172-0/+20
|
* add notes on abbrevsMartin Czygan2020-08-153-1/+2261
|
* include original and normalized name in default shelve (1G)Martin Czygan2020-08-153-8/+16
|
* separate cleanupsMartin Czygan2020-08-152-0/+47
|
* cleanup handling: add parameterMartin Czygan2020-08-154-19/+26
| | | | allow string cleanup be called directly
* update static filesMartin Czygan2020-08-152-1/+3
|
* add extra filesMartin Czygan2020-08-153-0/+17
|
* try out shelve for name lookupsMartin Czygan2020-08-151-10/+62
| | | | | uncompressed about 500 MB; marisa-trie would need extra encoding approach (plus it is a heavy dependency).
* update READMEMartin Czygan2020-08-151-1/+5
|
* issn: pair with issnlMartin Czygan2020-08-141-19/+26
|
* update planMartin Czygan2020-08-141-0/+5
|
* add de-jsonld flagMartin Czygan2020-08-141-15/+57
|
* issn: jsonld breakupMartin Czygan2020-08-131-25/+190
|
* update journal name notebookMartin Czygan2020-08-131-434/+442
|
* update notebookMartin Czygan2020-08-121-86/+729
|
* update READMEMartin Czygan2020-08-121-1/+3
|