aboutsummaryrefslogtreecommitdiffstats
path: root/notes/bulk_edits
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@robocracy.org>2021-11-24 15:48:42 -0800
committerBryan Newbold <bnewbold@robocracy.org>2021-11-24 15:48:42 -0800
commitfafc32e0ea1adc95eea817af7273d4c47422b364 (patch)
tree374281a68a1cb28dd6e964449887799c6c3cb3de /notes/bulk_edits
parentd6b1d3de6224b590a82b175f78b761df1a6df4a2 (diff)
downloadfatcat-fafc32e0ea1adc95eea817af7273d4c47422b364.tar.gz
fatcat-fafc32e0ea1adc95eea817af7273d4c47422b364.zip
codepsell fixes to notes
Diffstat (limited to 'notes/bulk_edits')
-rw-r--r--notes/bulk_edits/2019-10-08_file_cleanups.md2
-rw-r--r--notes/bulk_edits/2020-03-19_arxiv_pubmed.md2
-rw-r--r--notes/bulk_edits/2020-09-02_file_meta.md2
-rw-r--r--notes/bulk_edits/2020-12-23_dblp.md2
-rw-r--r--notes/bulk_edits/2020_datacite.md2
5 files changed, 5 insertions, 5 deletions
diff --git a/notes/bulk_edits/2019-10-08_file_cleanups.md b/notes/bulk_edits/2019-10-08_file_cleanups.md
index b61b37f0..2eebb363 100644
--- a/notes/bulk_edits/2019-10-08_file_cleanups.md
+++ b/notes/bulk_edits/2019-10-08_file_cleanups.md
@@ -5,7 +5,7 @@ web.archive.org). These URLs were created accidentally during fatcat
boostrapping; there are about 300k such file enties to fix.
Will also update archive.org link reltype to 'archive' (instead of
-'repository'), which is the new prefered style.
+'repository'), which is the new preferred style.
Generated the set of files to update like:
diff --git a/notes/bulk_edits/2020-03-19_arxiv_pubmed.md b/notes/bulk_edits/2020-03-19_arxiv_pubmed.md
index b2fd29d5..56e88880 100644
--- a/notes/bulk_edits/2020-03-19_arxiv_pubmed.md
+++ b/notes/bulk_edits/2020-03-19_arxiv_pubmed.md
@@ -1,7 +1,7 @@
On 2020-03-20, automated daily harvesting and importing of arxiv and pubmed
metadata started. In the case of pubmed, updates are enabled, so that recently
-created DOI releases get updated with PMID and extra metdata.
+created DOI releases get updated with PMID and extra metadata.
We also want to do last backfills of metadata since the last import up through
the first day updated by the continuous harvester.
diff --git a/notes/bulk_edits/2020-09-02_file_meta.md b/notes/bulk_edits/2020-09-02_file_meta.md
index 35c4d87f..b0606f2d 100644
--- a/notes/bulk_edits/2020-09-02_file_meta.md
+++ b/notes/bulk_edits/2020-09-02_file_meta.md
@@ -25,7 +25,7 @@ Partial wayback URL timestamps, for cases where we have the full timestamped URL
https://qa.fatcat.wiki/file/k73il3k5hzemtnkqa5qyorg6ci
https://qa.fatcat.wiki/file/7hstlrabfjb6vgyph7ntqtpkne
-Live-web URLs identical except for http/https flip or other trival things (much less frequent case):
+Live-web URLs identical except for http/https flip or other trivial things (much less frequent case):
http://eo1.gsfc.nasa.gov/new/validationReport/Technology/JoeCD/asner_etal_PNAS_20041.pdf
https://eo1.gsfc.nasa.gov/new/validationReport/Technology/JoeCD/asner_etal_PNAS_20041.pdf
diff --git a/notes/bulk_edits/2020-12-23_dblp.md b/notes/bulk_edits/2020-12-23_dblp.md
index c3ad0587..a33411cb 100644
--- a/notes/bulk_edits/2020-12-23_dblp.md
+++ b/notes/bulk_edits/2020-12-23_dblp.md
@@ -52,4 +52,4 @@ Run import:
=> Counter({'total': 7953365, 'has-doi': 4277307, 'skip': 3097418, 'skip-key-type': 2640968, 'skip-update': 2480449, 'exists': 943800, 'update': 889700, 'insert': 338842, 'skip-arxiv-corr': 312872, 'exists-fuzzy': 203103, 'skip-dblp-container-missing': 143578, 'skip-arxiv': 53, 'skip-title': 1})
Starting database size (roughly): Size: 684.08G
-Ending databse size: Size: 690.22G
+Ending database size: Size: 690.22G
diff --git a/notes/bulk_edits/2020_datacite.md b/notes/bulk_edits/2020_datacite.md
index 005841ae..05d09517 100644
--- a/notes/bulk_edits/2020_datacite.md
+++ b/notes/bulk_edits/2020_datacite.md
@@ -54,7 +54,7 @@ Compare with `--lang-detect`:
user 3m5.620s
sys 0m13.344s
-Not noticable?
+Not noticeable?
Whole run: