diff options
author | Martin Czygan <martin.czygan@gmail.com> | 2021-10-14 20:48:09 +0200 |
---|---|---|
committer | Martin Czygan <martin.czygan@gmail.com> | 2021-10-14 20:48:09 +0200 |
commit | f641822523d905fa454849ce7620ed3f54213604 (patch) | |
tree | 9b61d52b3c28b9ba3043e119f4e593c0b9d50379 /docs/TR-20210808100000-IA-WDS-REFCAT | |
parent | 7d562495ab1b57268f7aad3825656f07e5498fe0 (diff) | |
download | refcat-f641822523d905fa454849ce7620ed3f54213604.tar.gz refcat-f641822523d905fa454849ce7620ed3f54213604.zip |
docs: fix two typos in paper
Diffstat (limited to 'docs/TR-20210808100000-IA-WDS-REFCAT')
-rw-r--r-- | docs/TR-20210808100000-IA-WDS-REFCAT/main.tex | 4 |
1 files changed, 2 insertions, 2 deletions
diff --git a/docs/TR-20210808100000-IA-WDS-REFCAT/main.tex b/docs/TR-20210808100000-IA-WDS-REFCAT/main.tex index 317311c..ea8e348 100644 --- a/docs/TR-20210808100000-IA-WDS-REFCAT/main.tex +++ b/docs/TR-20210808100000-IA-WDS-REFCAT/main.tex @@ -210,7 +210,7 @@ identifiers; for 1,303,424,212 - or 98.49\% of all citations - we do have a DOI for both source and target). The majority of matches - 1,250,523,321 - is established through identifier based matching (DOI, PMIC, PMCID, ARXIV, ISBN). 72,900,351 citations are established through fuzzy matching techniques, where -references did not contain identifiers\footnote{This not necessary mean that +references did not contain identifiers\footnote{This not necessary means that the records in question do not have an identifier; however if an identifier existed, it was not part of the raw reference.}. Citations from the Open Citations' COCI corpus\footnote{Reference dataset COCI @@ -419,7 +419,7 @@ indexed into a search index and serves both matched and unmatched references for the web application, allowing for further collection of feedback on match quality and possible improvements. -With a few schema conversions, fuzzy matching has been be applied to Wikipedia +With a few schema conversions, fuzzy matching has been applied to Wikipedia articles and Open Library (edition) records as well. The aspect of precision and recall are represented by the two stages: we are generous in the match candidate generation phase in order to improve recall, but we are strict during |