From c4a96a00ffb5bcdee35ee36967e4da836ca58ed1 Mon Sep 17 00:00:00 2001 From: Martin Czygan Date: Sat, 15 May 2021 00:31:00 +0200 Subject: notes on urls --- python/notes/version_3.md | 6 ++++++ 1 file changed, 6 insertions(+) diff --git a/python/notes/version_3.md b/python/notes/version_3.md index 66840bf..c2cdc60 100644 --- a/python/notes/version_3.md +++ b/python/notes/version_3.md @@ -302,3 +302,9 @@ So maybe 500k isbn in total? * need to find them, then validate them +---- + +## Notes on URLList + +* about 25M urls +* about 11075871 seem to have a "doi" -- cgit v1.2.3