From fb76f7db65314bca50bda6fa9843240c4aacaea4 Mon Sep 17 00:00:00 2001 From: Martin Czygan Date: Sat, 7 Aug 2021 03:03:08 +0200 Subject: guide: reference graph, minor tweaks --- guide/src/reference_graph.md | 7 ++++--- 1 file changed, 4 insertions(+), 3 deletions(-) diff --git a/guide/src/reference_graph.md b/guide/src/reference_graph.md index 4f9f71dd..1470e124 100644 --- a/guide/src/reference_graph.md +++ b/guide/src/reference_graph.md @@ -119,8 +119,9 @@ only, and is missing many current references and citations. Additionally, only direct identifier lookups (eg, DOI matches) are used, not fuzzy metadata matching. -Open Library "target" matches are based on a snapshot of Open Library works, -and are matched either ISBN (extracted from citation string) or fuzzy metadata +Open Library "target" matches are based on a +[snapshot](https://openlibrary.org/developers/dumps) of Open Library works, and +are matched either ISBN (extracted from citation string) or fuzzy metadata matching. Crossref references are extracted from a January 2021 @@ -128,7 +129,7 @@ Crossref references are extracted from a January 2021 metadata, and do not include many updates to existing works. Hundreds of millions of raw citation strings ("unstructured") have not been -parsed into a structured for for fuzzy matching. We plan to use GROBID to parse +parsed into a structured for fuzzy matching. We plan to use GROBID to parse these citation strings, in addition to the current use of GROBID parsing for references from fulltext documents. -- cgit v1.2.3