aboutsummaryrefslogtreecommitdiffstats
Commit message (Expand)AuthorAgeFilesLines
* use grobid_tei_xml for grobid unstructured lookupsBryan Newbold2021-10-284-277/+56
* Merge branch 'bnewbold-tweaks' into 'master'Martin Czygan2021-10-283-3/+5
|\
| * bump fatcat-openapi-client version to 0.4.0Bryan Newbold2021-10-271-1/+1
| * matching: include contribs,files in release entityBryan Newbold2021-10-271-1/+1
| * packaging: include py.typed for mypy to detectBryan Newbold2021-10-272-0/+1
| * deps: pin elasticsearch to less than 7.14Bryan Newbold2021-10-271-1/+2
|/
* start larger refactoring: remove clusterMartin Czygan2021-09-249-723/+188
* setup: narrow dependency versionsMartin Czygan2021-09-211-4/+4
* Merge branch 'wip-martin-review-cleanup' into 'master'Martin Czygan2021-09-2113-20/+272
|\
| * tests: temporarily disable testsMartin Czygan2021-09-211-12/+12
| * matching: run an additional es query for fuzzy matchingMartin Czygan2021-09-212-3/+93
| * reorganize notesMartin Czygan2021-09-216-2/+153
| * style: apply formattingMartin Czygan2021-09-217-11/+22
|/
* matching: actually return the specified number of resultsMartin Czygan2021-09-151-2/+2
* add todoMartin Czygan2021-09-141-0/+28
* remove pipenv related filesMartin Czygan2021-09-135-979/+25
* v0.1.22Martin Czygan2021-09-131-1/+1
* cluster: adjust tests to jellyfish nysiis implementationMartin Czygan2021-09-131-7/+7
* update READMEMartin Czygan2021-09-131-4/+7
* remove dependency on fuzzy; use jellyfishMartin Czygan2021-09-134-304/+286
* cleanup makefileMartin Czygan2021-09-131-2/+0
* update mentions of cgraph to refcatBryan Newbold2021-09-102-2/+2
* Merge branch 'master' of git.archive.org:webgroup/fuzzycatMartin Czygan2021-07-098-224/+318
|\
| * Merge branch 'bnewbold-readme' into 'master'Martin Czygan2021-07-072-210/+245
| |\
| | * simplify README for general audience; move some content to notesBryan Newbold2021-07-012-210/+245
| * | Merge branch 'bnewbold-verify-improvements' into 'master'Martin Czygan2021-07-026-14/+73
| |\ \ | | |/ | |/|
| | * sandcrawler slugify: lower-case greek ambiguity (OCR)Bryan Newbold2021-07-011-2/+13
| | * DOI clean/normalize helper; and use in verification etcBryan Newbold2021-07-015-6/+35
| | * verify: page count parsing and comparison improvementsBryan Newbold2021-07-013-6/+25
| |/
* | add a few (open) tests casesMartin Czygan2021-07-096-0/+176
* | notes on matching metricsMartin Czygan2021-07-081-0/+16
* | cleanup notesMartin Czygan2021-07-082-13/+0
|/
* add test caseMartin Czygan2021-06-214-0/+1339
* v0.1.21Martin Czygan2021-06-011-1/+1
* Merge branch 'bnewbold-bugfixes' into 'master'Martin Czygan2021-06-019-86/+110
|\
| * lint: remove unused importsBryan Newbold2021-05-317-10/+1
| * rebuild Pipefile.lock, for 'fuzzy' depBryan Newbold2021-05-311-75/+101
| * setup.py: express dynaconf dependencyBryan Newbold2021-05-311-0/+1
| * matching: handle extid not found case (fatcat API HTTP 400 or 404)Bryan Newbold2021-05-311-1/+7
|/
* add test caseMartin Czygan2021-05-263-0/+83
* add testMartin Czygan2021-05-123-0/+603
* add test casesMartin Czygan2021-05-0610-0/+1861
* add test caseMartin Czygan2021-04-203-0/+107
* ignore pyproject.tomlMartin Czygan2021-04-171-0/+3
* update lock fileMartin Czygan2021-04-171-156/+184
* add testMartin Czygan2021-04-173-0/+1982
* v0.1.20Martin Czygan2021-04-151-1/+1
* addess #2Martin Czygan2021-04-152-0/+4
* Merge branch 'bnewbold-upstreaming' into 'master'Martin Czygan2021-04-156-1/+823
|\
| * main: 'unstructured' CLI demoBryan Newbold2021-04-141-1/+38