aboutsummaryrefslogtreecommitdiffstats
path: root/notes/doaj_graph.md
diff options
context:
space:
mode:
Diffstat (limited to 'notes/doaj_graph.md')
-rw-r--r--notes/doaj_graph.md20
1 files changed, 0 insertions, 20 deletions
diff --git a/notes/doaj_graph.md b/notes/doaj_graph.md
deleted file mode 100644
index 449220b..0000000
--- a/notes/doaj_graph.md
+++ /dev/null
@@ -1,20 +0,0 @@
-# DOAJ Citation Graph
-
-This dataset contains a subset of the edges of the Internet Archive (IA)
-Scholar Citation Graph (v1, 2021-07-28, named: refcat) where either the citing
-or the cited work (or both) are part of DOAJ.
-
-Basic numbers:
-
-* DOAJ DOI used for matching edges: 4,886,099
-* Catalog entries via DOI in fatcat: 4,773,245
-* We find 124,760,397 edges, of these; 98,616,033 have a source belonging to
- DOAJ; 34,910,769 have an article in DOAJ as target; intra-DOAJ: 8,766,405
-* How do we find these edges? By id: 118,314,316; via fuzzy matching:
- 6,446,081 (5.17%)
-
-The IA Scholar citation graph is documented in various places:
-
-* https://blog.archive.org/2021/10/19/internet-archive-releases-refcat-the-ia-scholar-index-of-over-1-3-billion-scholarly-citations/
-* https://guide.fatcat.wiki/reference_graph.html
-* https://arxiv.org/abs/2110.06595