aboutsummaryrefslogtreecommitdiffstats
Commit message (Expand)AuthorAgeFilesLines
* apply first round of feedback on matchingHEADmasterMartin Czygan2021-12-2113-9/+73
* matching: track_total_hits, use FalseMartin Czygan2021-12-161-4/+4
* matching: we do not need exact match countsMartin Czygan2021-12-161-4/+4
* matching: add hdl, remove mag idMartin Czygan2021-12-161-1/+1
* matching: cleanup and documentationMartin Czygan2021-12-071-47/+29
* matching: update docsMartin Czygan2021-12-071-9/+8
* v0.1.23Martin Czygan2021-12-061-1/+1
* matching: cleanup test filesMartin Czygan2021-12-0624-202/+1
* complete FuzzyReleaseMatcher refactoringMartin Czygan2021-12-0614-362/+644
* complete migration from away from match_release_fuzzyMartin Czygan2021-11-164-247/+7
* update todoMartin Czygan2021-11-161-1/+2
* Merge branch 'martin-matcher-class' into 'master'Martin Czygan2021-11-1624-112/+1410
|\
| * use elasticsearch <7.14 search argsMartin Czygan2021-11-161-11/+47
| * setup: add missing pyyaml dependencyMartin Czygan2021-11-161-0/+1
| * setup: add thefuzz dependencyMartin Czygan2021-11-161-1/+2
| * turn "match_release_fuzzy" into a classMartin Czygan2021-11-1623-111/+1371
|/
* Merge branch 'bnewbold-grobid-tei-xml' into 'master'Martin Czygan2021-11-044-277/+56
|\
| * use grobid_tei_xml for grobid unstructured lookupsBryan Newbold2021-10-284-277/+56
|/
* Merge branch 'bnewbold-tweaks' into 'master'Martin Czygan2021-10-283-3/+5
|\
| * bump fatcat-openapi-client version to 0.4.0Bryan Newbold2021-10-271-1/+1
| * matching: include contribs,files in release entityBryan Newbold2021-10-271-1/+1
| * packaging: include py.typed for mypy to detectBryan Newbold2021-10-272-0/+1
| * deps: pin elasticsearch to less than 7.14Bryan Newbold2021-10-271-1/+2
|/
* start larger refactoring: remove clusterMartin Czygan2021-09-249-723/+188
* setup: narrow dependency versionsMartin Czygan2021-09-211-4/+4
* Merge branch 'wip-martin-review-cleanup' into 'master'Martin Czygan2021-09-2113-20/+272
|\
| * tests: temporarily disable testsMartin Czygan2021-09-211-12/+12
| * matching: run an additional es query for fuzzy matchingMartin Czygan2021-09-212-3/+93
| * reorganize notesMartin Czygan2021-09-216-2/+153
| * style: apply formattingMartin Czygan2021-09-217-11/+22
|/
* matching: actually return the specified number of resultsMartin Czygan2021-09-151-2/+2
* add todoMartin Czygan2021-09-141-0/+28
* remove pipenv related filesMartin Czygan2021-09-135-979/+25
* v0.1.22Martin Czygan2021-09-131-1/+1
* cluster: adjust tests to jellyfish nysiis implementationMartin Czygan2021-09-131-7/+7
* update READMEMartin Czygan2021-09-131-4/+7
* remove dependency on fuzzy; use jellyfishMartin Czygan2021-09-134-304/+286
* cleanup makefileMartin Czygan2021-09-131-2/+0
* update mentions of cgraph to refcatBryan Newbold2021-09-102-2/+2
* Merge branch 'master' of git.archive.org:webgroup/fuzzycatMartin Czygan2021-07-098-224/+318
|\
| * Merge branch 'bnewbold-readme' into 'master'Martin Czygan2021-07-072-210/+245
| |\
| | * simplify README for general audience; move some content to notesBryan Newbold2021-07-012-210/+245
| * | Merge branch 'bnewbold-verify-improvements' into 'master'Martin Czygan2021-07-026-14/+73
| |\ \ | | |/ | |/|
| | * sandcrawler slugify: lower-case greek ambiguity (OCR)Bryan Newbold2021-07-011-2/+13
| | * DOI clean/normalize helper; and use in verification etcBryan Newbold2021-07-015-6/+35
| | * verify: page count parsing and comparison improvementsBryan Newbold2021-07-013-6/+25
| |/
* | add a few (open) tests casesMartin Czygan2021-07-096-0/+176
* | notes on matching metricsMartin Czygan2021-07-081-0/+16
* | cleanup notesMartin Czygan2021-07-082-13/+0
|/
* add test caseMartin Czygan2021-06-214-0/+1339