diff options
author | bnewbold <bnewbold@archive.org> | 2021-08-09 17:04:53 +0000 |
---|---|---|
committer | bnewbold <bnewbold@archive.org> | 2021-08-09 17:04:53 +0000 |
commit | 91bbd3056a159afe1511b26d230862d053ab96be (patch) | |
tree | 1de0406e9429d50eb2e37a8afcd18e7515ff4878 /guide/src | |
parent | d4fe5c0419e8e6c8e7c13ac52c6085e6e4ddcb1b (diff) | |
parent | fb76f7db65314bca50bda6fa9843240c4aacaea4 (diff) | |
download | fatcat-91bbd3056a159afe1511b26d230862d053ab96be.tar.gz fatcat-91bbd3056a159afe1511b26d230862d053ab96be.zip |
Merge branch 'martin-guide-ref-minor-tweaks' into 'master'
guide: reference graph, minor tweaks
See merge request webgroup/fatcat!115
Diffstat (limited to 'guide/src')
-rw-r--r-- | guide/src/reference_graph.md | 7 |
1 files changed, 4 insertions, 3 deletions
diff --git a/guide/src/reference_graph.md b/guide/src/reference_graph.md index 4f9f71dd..1470e124 100644 --- a/guide/src/reference_graph.md +++ b/guide/src/reference_graph.md @@ -119,8 +119,9 @@ only, and is missing many current references and citations. Additionally, only direct identifier lookups (eg, DOI matches) are used, not fuzzy metadata matching. -Open Library "target" matches are based on a snapshot of Open Library works, -and are matched either ISBN (extracted from citation string) or fuzzy metadata +Open Library "target" matches are based on a +[snapshot](https://openlibrary.org/developers/dumps) of Open Library works, and +are matched either ISBN (extracted from citation string) or fuzzy metadata matching. Crossref references are extracted from a January 2021 @@ -128,7 +129,7 @@ Crossref references are extracted from a January 2021 metadata, and do not include many updates to existing works. Hundreds of millions of raw citation strings ("unstructured") have not been -parsed into a structured for for fuzzy matching. We plan to use GROBID to parse +parsed into a structured for fuzzy matching. We plan to use GROBID to parse these citation strings, in addition to the current use of GROBID parsing for references from fulltext documents. |