aboutsummaryrefslogtreecommitdiffstats
path: root/extra/cdx/README.md
blob: 609bfd943fd4e7b464d690ea164e7c8d4116b7a7 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
# Sample CDX Links

Given a sample from outbound web links from publications, determine number of
URLs we may have. We currently find about 44368911 URLs in the refs.

Limit to 10000 links.

10k random:

```
   7010 OK
   2940 MISS
```

10k w/o doi.org:

```
   6442 OK
   3480 MISS
```