aboutsummaryrefslogtreecommitdiffstats
path: root/notes
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2022-04-26 15:27:54 -0700
committerBryan Newbold <bnewbold@archive.org>2022-04-26 15:27:54 -0700
commitc0db231f1eebcf3acd78f0bf759e3df84e1d3b79 (patch)
tree03643592a1a21dbc7d66ba97f2e6a8892de4f196 /notes
parent6c4b358ce241709c476680492ce65600301f9abb (diff)
downloadsandcrawler-c0db231f1eebcf3acd78f0bf759e3df84e1d3b79.tar.gz
sandcrawler-c0db231f1eebcf3acd78f0bf759e3df84e1d3b79.zip
.ua crawling follow-up stats
Diffstat (limited to 'notes')
-rw-r--r--notes/tasks/2022-03-07_ukraine_firedrill.md4
1 files changed, 2 insertions, 2 deletions
diff --git a/notes/tasks/2022-03-07_ukraine_firedrill.md b/notes/tasks/2022-03-07_ukraine_firedrill.md
index 222f9b7..c727a57 100644
--- a/notes/tasks/2022-03-07_ukraine_firedrill.md
+++ b/notes/tasks/2022-03-07_ukraine_firedrill.md
@@ -6,8 +6,8 @@ Want to do priority crawling of Ukranian web content, plus Russia and Belarus.
(country_code:ua OR lang:uk)
=> 2022-03-08, before ingests: 470,986 total, 170,987 missing, almost all article-journal, peak in 2019, 55k explicitly OA
- => later in day, already some 22k missing found! wow
-
+ later in day, already some 22k missing found! wow
+ => 2022-04-04, after ingests: 476,174 total, 131,063 missing, 49k OA missing
## Metadata Prep