aboutsummaryrefslogtreecommitdiffstats
path: root/docs
diff options
context:
space:
mode:
Diffstat (limited to 'docs')
-rw-r--r--docs/TR-20210808100000-IA-WDS-REFCAT/main.tex4
1 files changed, 2 insertions, 2 deletions
diff --git a/docs/TR-20210808100000-IA-WDS-REFCAT/main.tex b/docs/TR-20210808100000-IA-WDS-REFCAT/main.tex
index 317311c..ea8e348 100644
--- a/docs/TR-20210808100000-IA-WDS-REFCAT/main.tex
+++ b/docs/TR-20210808100000-IA-WDS-REFCAT/main.tex
@@ -210,7 +210,7 @@ identifiers; for 1,303,424,212 - or 98.49\% of all citations - we do have a DOI
for both source and target). The majority of matches - 1,250,523,321 - is
established through identifier based matching (DOI, PMIC, PMCID, ARXIV, ISBN).
72,900,351 citations are established through fuzzy matching techniques, where
-references did not contain identifiers\footnote{This not necessary mean that
+references did not contain identifiers\footnote{This not necessary means that
the records in question do not have an identifier; however if an identifier
existed, it was not part of the raw reference.}.
Citations from the Open Citations' COCI corpus\footnote{Reference dataset COCI
@@ -419,7 +419,7 @@ indexed into a search index and serves both matched and unmatched references
for the web application, allowing for further collection of feedback on match
quality and possible improvements.
-With a few schema conversions, fuzzy matching has been be applied to Wikipedia
+With a few schema conversions, fuzzy matching has been applied to Wikipedia
articles and Open Library (edition) records as well. The aspect of precision
and recall are represented by the two stages: we are generous in the match
candidate generation phase in order to improve recall, but we are strict during