diff options
author | Bryan Newbold <bnewbold@archive.org> | 2022-05-03 17:14:08 -0700 |
---|---|---|
committer | Bryan Newbold <bnewbold@archive.org> | 2022-05-03 17:15:18 -0700 |
commit | ac7c44d332fcba83faae6a3e732c3415f6ab78a6 (patch) | |
tree | 3cf258f75b301d93e01552b59d439cd10e2c8a13 /notes/tasks/2021-09-09_pdf_url_lists.md | |
parent | 6dd9bc8d3312107796344341e43044907677bf85 (diff) | |
download | sandcrawler-ac7c44d332fcba83faae6a3e732c3415f6ab78a6.tar.gz sandcrawler-ac7c44d332fcba83faae6a3e732c3415f6ab78a6.zip |
PDF URL lists update
Diffstat (limited to 'notes/tasks/2021-09-09_pdf_url_lists.md')
-rw-r--r-- | notes/tasks/2021-09-09_pdf_url_lists.md | 4 |
1 files changed, 4 insertions, 0 deletions
diff --git a/notes/tasks/2021-09-09_pdf_url_lists.md b/notes/tasks/2021-09-09_pdf_url_lists.md index 52a3264..cd8176e 100644 --- a/notes/tasks/2021-09-09_pdf_url_lists.md +++ b/notes/tasks/2021-09-09_pdf_url_lists.md @@ -64,3 +64,7 @@ ingest_file_result table, pdf, success: 66,487,928 "Parsed web PDFs": `file_meta`, left join CDX (didn't do this one) + +--- + +Uploaded all these to <https://archive.org/download/ia_scholarly_urls_2021-09-09> |