aboutsummaryrefslogtreecommitdiffstats
diff options
context:
space:
mode:
-rw-r--r--python/notes/version_4.md38
1 files changed, 38 insertions, 0 deletions
diff --git a/python/notes/version_4.md b/python/notes/version_4.md
index e7be1e6..c7fcf40 100644
--- a/python/notes/version_4.md
+++ b/python/notes/version_4.md
@@ -891,3 +891,41 @@ igyewr6er5epfozhk7dyfqa5tu igyewr6er5epfozhk7dyfqa5tu exact doi
Completed 24/25 jobs in 42h with a few failures from disk space issues.
* 2562m34.844s
+
+----
+
+# Stats
+
+* BrefCombined
+
+```
+1,175,653,287 exact doi
+ 540,957,187 unmatched unknown
+ 62,621,153 strong jaccardauthors
+ 56,950,207 exact pmid
+ 11,942,348 strong slugtitleauthormatch
+ 10,278,122 strong tokenizedauthors
+ 3,419,958 exact arxiv
+ 2,479,491 exact titleauthormatch
+ 592,522 exact isbn
+ 462,252 strong versioneddoi
+ 91,076 strong pmiddoipair
+ 77,030 strong customieeearxiv
+ 57,290 strong dataciterelatedid
+ 33,837 strong arxivversion
+ 18,448 exact pmcid
+ 1,749 exact workid
+ 1,241 strong figshareversion
+ 557 strong titleartifact
+ 10 strong custombsiundated
+ 2 strong custombsisubdoc
+```
+
+Of the 540M unmatched, we have:
+
+```
+ Unstr CSL
+
+404285013 true false
+136672174 false true
+```