From fafc32e0ea1adc95eea817af7273d4c47422b364 Mon Sep 17 00:00:00 2001 From: Bryan Newbold Date: Wed, 24 Nov 2021 15:48:42 -0800 Subject: codepsell fixes to notes --- notes/bulk_edits/2019-10-08_file_cleanups.md | 2 +- notes/bulk_edits/2020-03-19_arxiv_pubmed.md | 2 +- notes/bulk_edits/2020-09-02_file_meta.md | 2 +- notes/bulk_edits/2020-12-23_dblp.md | 2 +- notes/bulk_edits/2020_datacite.md | 2 +- 5 files changed, 5 insertions(+), 5 deletions(-) (limited to 'notes/bulk_edits') diff --git a/notes/bulk_edits/2019-10-08_file_cleanups.md b/notes/bulk_edits/2019-10-08_file_cleanups.md index b61b37f0..2eebb363 100644 --- a/notes/bulk_edits/2019-10-08_file_cleanups.md +++ b/notes/bulk_edits/2019-10-08_file_cleanups.md @@ -5,7 +5,7 @@ web.archive.org). These URLs were created accidentally during fatcat boostrapping; there are about 300k such file enties to fix. Will also update archive.org link reltype to 'archive' (instead of -'repository'), which is the new prefered style. +'repository'), which is the new preferred style. Generated the set of files to update like: diff --git a/notes/bulk_edits/2020-03-19_arxiv_pubmed.md b/notes/bulk_edits/2020-03-19_arxiv_pubmed.md index b2fd29d5..56e88880 100644 --- a/notes/bulk_edits/2020-03-19_arxiv_pubmed.md +++ b/notes/bulk_edits/2020-03-19_arxiv_pubmed.md @@ -1,7 +1,7 @@ On 2020-03-20, automated daily harvesting and importing of arxiv and pubmed metadata started. In the case of pubmed, updates are enabled, so that recently -created DOI releases get updated with PMID and extra metdata. +created DOI releases get updated with PMID and extra metadata. We also want to do last backfills of metadata since the last import up through the first day updated by the continuous harvester. diff --git a/notes/bulk_edits/2020-09-02_file_meta.md b/notes/bulk_edits/2020-09-02_file_meta.md index 35c4d87f..b0606f2d 100644 --- a/notes/bulk_edits/2020-09-02_file_meta.md +++ b/notes/bulk_edits/2020-09-02_file_meta.md @@ -25,7 +25,7 @@ Partial wayback URL timestamps, for cases where we have the full timestamped URL https://qa.fatcat.wiki/file/k73il3k5hzemtnkqa5qyorg6ci https://qa.fatcat.wiki/file/7hstlrabfjb6vgyph7ntqtpkne -Live-web URLs identical except for http/https flip or other trival things (much less frequent case): +Live-web URLs identical except for http/https flip or other trivial things (much less frequent case): http://eo1.gsfc.nasa.gov/new/validationReport/Technology/JoeCD/asner_etal_PNAS_20041.pdf https://eo1.gsfc.nasa.gov/new/validationReport/Technology/JoeCD/asner_etal_PNAS_20041.pdf diff --git a/notes/bulk_edits/2020-12-23_dblp.md b/notes/bulk_edits/2020-12-23_dblp.md index c3ad0587..a33411cb 100644 --- a/notes/bulk_edits/2020-12-23_dblp.md +++ b/notes/bulk_edits/2020-12-23_dblp.md @@ -52,4 +52,4 @@ Run import: => Counter({'total': 7953365, 'has-doi': 4277307, 'skip': 3097418, 'skip-key-type': 2640968, 'skip-update': 2480449, 'exists': 943800, 'update': 889700, 'insert': 338842, 'skip-arxiv-corr': 312872, 'exists-fuzzy': 203103, 'skip-dblp-container-missing': 143578, 'skip-arxiv': 53, 'skip-title': 1}) Starting database size (roughly): Size: 684.08G -Ending databse size: Size: 690.22G +Ending database size: Size: 690.22G diff --git a/notes/bulk_edits/2020_datacite.md b/notes/bulk_edits/2020_datacite.md index 005841ae..05d09517 100644 --- a/notes/bulk_edits/2020_datacite.md +++ b/notes/bulk_edits/2020_datacite.md @@ -54,7 +54,7 @@ Compare with `--lang-detect`: user 3m5.620s sys 0m13.344s -Not noticable? +Not noticeable? Whole run: -- cgit v1.2.3