aboutsummaryrefslogtreecommitdiffstats
diff options
context:
space:
mode:
authorMartin Czygan <martin.czygan@gmail.com>2021-07-14 21:36:18 +0200
committerMartin Czygan <martin.czygan@gmail.com>2021-07-14 21:36:18 +0200
commit26235bca235c9785752856ab7ebbdf7bf8806be8 (patch)
treef8161a375d5a46d83ebf5f26c4701944f568bd14
parent2f27dcc92bf40872ceedfbb1a9709c01d2383f5e (diff)
downloadrefcat-26235bca235c9785752856ab7ebbdf7bf8806be8.tar.gz
refcat-26235bca235c9785752856ab7ebbdf7bf8806be8.zip
notes: 2021-07-06 version
-rw-r--r--python/notes/version_4.md38
1 files changed, 38 insertions, 0 deletions
diff --git a/python/notes/version_4.md b/python/notes/version_4.md
index e7be1e6..c7fcf40 100644
--- a/python/notes/version_4.md
+++ b/python/notes/version_4.md
@@ -891,3 +891,41 @@ igyewr6er5epfozhk7dyfqa5tu igyewr6er5epfozhk7dyfqa5tu exact doi
Completed 24/25 jobs in 42h with a few failures from disk space issues.
* 2562m34.844s
+
+----
+
+# Stats
+
+* BrefCombined
+
+```
+1,175,653,287 exact doi
+ 540,957,187 unmatched unknown
+ 62,621,153 strong jaccardauthors
+ 56,950,207 exact pmid
+ 11,942,348 strong slugtitleauthormatch
+ 10,278,122 strong tokenizedauthors
+ 3,419,958 exact arxiv
+ 2,479,491 exact titleauthormatch
+ 592,522 exact isbn
+ 462,252 strong versioneddoi
+ 91,076 strong pmiddoipair
+ 77,030 strong customieeearxiv
+ 57,290 strong dataciterelatedid
+ 33,837 strong arxivversion
+ 18,448 exact pmcid
+ 1,749 exact workid
+ 1,241 strong figshareversion
+ 557 strong titleartifact
+ 10 strong custombsiundated
+ 2 strong custombsisubdoc
+```
+
+Of the 540M unmatched, we have:
+
+```
+ Unstr CSL
+
+404285013 true false
+136672174 false true
+```