diff options
author | Martin Czygan <martin.czygan@gmail.com> | 2021-06-28 20:03:58 +0200 |
---|---|---|
committer | Martin Czygan <martin.czygan@gmail.com> | 2021-06-28 20:03:58 +0200 |
commit | eb71aa4b05c1e02d2e125b9a5d16adc23ee71560 (patch) | |
tree | 62b9af82d44fed32b2760868a9b2fb2aa3459891 | |
parent | f4442f600b3f66704063ac91ce2769fa250751c9 (diff) | |
download | refcat-eb71aa4b05c1e02d2e125b9a5d16adc23ee71560.tar.gz refcat-eb71aa4b05c1e02d2e125b9a5d16adc23ee71560.zip |
notes: add some numbers
-rw-r--r-- | python/notes/version_4.md | 8 |
1 files changed, 4 insertions, 4 deletions
diff --git a/python/notes/version_4.md b/python/notes/version_4.md index 97811e7..4d8e9e3 100644 --- a/python/notes/version_4.md +++ b/python/notes/version_4.md @@ -879,7 +879,7 @@ igyewr6er5epfozhk7dyfqa5tu igyewr6er5epfozhk7dyfqa5tu exact doi * total unique edges: 740248530 * matches by id: 623707690 * matches though title/author (fuzzy) matching: 116540840 -* scholarly resources: -* linked open library titles: -* URLs extracted from corpus: -* sample ratio IA/URL from corpus: +* scholarly resources: 727853720 +* linked open library titles: 12394810 +* URLs extracted from corpus: 25405592 +* sample ratio IA/URL from corpus (N=100000): |