diff options
author | Bryan Newbold <bnewbold@archive.org> | 2022-04-26 15:27:54 -0700 |
---|---|---|
committer | Bryan Newbold <bnewbold@archive.org> | 2022-04-26 15:27:54 -0700 |
commit | c0db231f1eebcf3acd78f0bf759e3df84e1d3b79 (patch) | |
tree | 03643592a1a21dbc7d66ba97f2e6a8892de4f196 /notes/tasks | |
parent | 6c4b358ce241709c476680492ce65600301f9abb (diff) | |
download | sandcrawler-c0db231f1eebcf3acd78f0bf759e3df84e1d3b79.tar.gz sandcrawler-c0db231f1eebcf3acd78f0bf759e3df84e1d3b79.zip |
.ua crawling follow-up stats
Diffstat (limited to 'notes/tasks')
-rw-r--r-- | notes/tasks/2022-03-07_ukraine_firedrill.md | 4 |
1 files changed, 2 insertions, 2 deletions
diff --git a/notes/tasks/2022-03-07_ukraine_firedrill.md b/notes/tasks/2022-03-07_ukraine_firedrill.md index 222f9b7..c727a57 100644 --- a/notes/tasks/2022-03-07_ukraine_firedrill.md +++ b/notes/tasks/2022-03-07_ukraine_firedrill.md @@ -6,8 +6,8 @@ Want to do priority crawling of Ukranian web content, plus Russia and Belarus. (country_code:ua OR lang:uk) => 2022-03-08, before ingests: 470,986 total, 170,987 missing, almost all article-journal, peak in 2019, 55k explicitly OA - => later in day, already some 22k missing found! wow - + later in day, already some 22k missing found! wow + => 2022-04-04, after ingests: 476,174 total, 131,063 missing, 49k OA missing ## Metadata Prep |