aboutsummaryrefslogtreecommitdiffstats
diff options
context:
space:
mode:
-rw-r--r--docs/TR-20210808100000-IA-WDS-REFCAT/main.pdfbin95918 -> 95948 bytes
-rw-r--r--docs/TR-20210808100000-IA-WDS-REFCAT/main.tex9
2 files changed, 5 insertions, 4 deletions
diff --git a/docs/TR-20210808100000-IA-WDS-REFCAT/main.pdf b/docs/TR-20210808100000-IA-WDS-REFCAT/main.pdf
index 46ccca7..75a5449 100644
--- a/docs/TR-20210808100000-IA-WDS-REFCAT/main.pdf
+++ b/docs/TR-20210808100000-IA-WDS-REFCAT/main.pdf
Binary files differ
diff --git a/docs/TR-20210808100000-IA-WDS-REFCAT/main.tex b/docs/TR-20210808100000-IA-WDS-REFCAT/main.tex
index bbf9fa9..76f1456 100644
--- a/docs/TR-20210808100000-IA-WDS-REFCAT/main.tex
+++ b/docs/TR-20210808100000-IA-WDS-REFCAT/main.tex
@@ -49,9 +49,10 @@
Wikipedia\footnote{\url{https://wikipedia.org}}. This first version of the
graph consists of 1,323,423,672 citations. We release this dataset under a CC0
Public Domain Dedication, accessible through an archive
- item\footnote{\url{https://archive.org/details/refcat_2021-07-28}}. All
- code used in the derivation process is released under an MIT
- license\footnote{\url{https://gitlab.com/internetarchive/cgraph}}.
+ item\footnote{\url{https://archive.org/details/refcat_2021-07-28}}.
+ The source code used for the derivation process, including exact and fuzzy
+ citation matching, is released under an MIT
+ license\footnote{\url{https://gitlab.com/internetarchive/refcat}}.
\end{abstract}
\keywords{Citation Graph, Web Archiving}
@@ -241,7 +242,7 @@ candidate generation phase in order to improve recall, but we are strict during
verification, in order to control precision. Quality assurance for verification is
implemented through a growing list of test cases of real examples from the catalog and
their expected or desired match status\footnote{The list can be found under:
- \url{https://gitlab.com/internetarchive/cgraph/-/blob/master/skate/testdata/verify.csv}.
+ \url{https://gitlab.com/internetarchive/refcat/-/blob/master/skate/testdata/verify.csv}.
It is helpful to keep this test suite independent of any specific programming language.}.