diff options
author | Martin Czygan <martin.czygan@gmail.com> | 2021-08-07 03:03:08 +0200 |
---|---|---|
committer | Martin Czygan <martin.czygan@gmail.com> | 2021-08-07 03:03:08 +0200 |
commit | fb76f7db65314bca50bda6fa9843240c4aacaea4 (patch) | |
tree | 1de0406e9429d50eb2e37a8afcd18e7515ff4878 /guide/src | |
parent | d4fe5c0419e8e6c8e7c13ac52c6085e6e4ddcb1b (diff) | |
download | fatcat-fb76f7db65314bca50bda6fa9843240c4aacaea4.tar.gz fatcat-fb76f7db65314bca50bda6fa9843240c4aacaea4.zip |
guide: reference graph, minor tweaks
Diffstat (limited to 'guide/src')
-rw-r--r-- | guide/src/reference_graph.md | 7 |
1 files changed, 4 insertions, 3 deletions
diff --git a/guide/src/reference_graph.md b/guide/src/reference_graph.md index 4f9f71dd..1470e124 100644 --- a/guide/src/reference_graph.md +++ b/guide/src/reference_graph.md @@ -119,8 +119,9 @@ only, and is missing many current references and citations. Additionally, only direct identifier lookups (eg, DOI matches) are used, not fuzzy metadata matching. -Open Library "target" matches are based on a snapshot of Open Library works, -and are matched either ISBN (extracted from citation string) or fuzzy metadata +Open Library "target" matches are based on a +[snapshot](https://openlibrary.org/developers/dumps) of Open Library works, and +are matched either ISBN (extracted from citation string) or fuzzy metadata matching. Crossref references are extracted from a January 2021 @@ -128,7 +129,7 @@ Crossref references are extracted from a January 2021 metadata, and do not include many updates to existing works. Hundreds of millions of raw citation strings ("unstructured") have not been -parsed into a structured for for fuzzy matching. We plan to use GROBID to parse +parsed into a structured for fuzzy matching. We plan to use GROBID to parse these citation strings, in addition to the current use of GROBID parsing for references from fulltext documents. |