From f641822523d905fa454849ce7620ed3f54213604 Mon Sep 17 00:00:00 2001 From: Martin Czygan Date: Thu, 14 Oct 2021 20:48:09 +0200 Subject: docs: fix two typos in paper --- docs/TR-20210808100000-IA-WDS-REFCAT/main.tex | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) (limited to 'docs') diff --git a/docs/TR-20210808100000-IA-WDS-REFCAT/main.tex b/docs/TR-20210808100000-IA-WDS-REFCAT/main.tex index 317311c..ea8e348 100644 --- a/docs/TR-20210808100000-IA-WDS-REFCAT/main.tex +++ b/docs/TR-20210808100000-IA-WDS-REFCAT/main.tex @@ -210,7 +210,7 @@ identifiers; for 1,303,424,212 - or 98.49\% of all citations - we do have a DOI for both source and target). The majority of matches - 1,250,523,321 - is established through identifier based matching (DOI, PMIC, PMCID, ARXIV, ISBN). 72,900,351 citations are established through fuzzy matching techniques, where -references did not contain identifiers\footnote{This not necessary mean that +references did not contain identifiers\footnote{This not necessary means that the records in question do not have an identifier; however if an identifier existed, it was not part of the raw reference.}. Citations from the Open Citations' COCI corpus\footnote{Reference dataset COCI @@ -419,7 +419,7 @@ indexed into a search index and serves both matched and unmatched references for the web application, allowing for further collection of feedback on match quality and possible improvements. -With a few schema conversions, fuzzy matching has been be applied to Wikipedia +With a few schema conversions, fuzzy matching has been applied to Wikipedia articles and Open Library (edition) records as well. The aspect of precision and recall are represented by the two stages: we are generous in the match candidate generation phase in order to improve recall, but we are strict during -- cgit v1.2.3