summaryrefslogtreecommitdiffstats
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@robocracy.org>2019-04-30 17:16:38 -0700
committerBryan Newbold <bnewbold@robocracy.org>2019-04-30 17:16:38 -0700
commite12f584a658658d8393753a89b88186e8322e59c (patch)
tree20fefec2d362dde3354e26bf09de0af5daef0487
parent2340854121b83f190094cd45f2a2da9e93ec5162 (diff)
downloadfatcat-e12f584a658658d8393753a89b88186e8322e59c.tar.gz
fatcat-e12f584a658658d8393753a89b88186e8322e59c.zip
ideas for some bots to build
-rw-r--r--notes/bot_ideas.txt26
1 files changed, 26 insertions, 0 deletions
diff --git a/notes/bot_ideas.txt b/notes/bot_ideas.txt
new file mode 100644
index 00000000..2d978279
--- /dev/null
+++ b/notes/bot_ideas.txt
@@ -0,0 +1,26 @@
+
+check for known bad files (zero bytes, etc) by hash
+
+## extid / ExtidReviewBot
+
+Deeper check of external identifiers.
+
+Checks:
+- duplication of entities: that no entities with the same identifiers already exist
+- DOI numbers should be registered (check doi.org)
+
+## style
+
+Check release entity style.
+
+- release_type, status, etc are set
+- title doesn't end in a period
+- at least one contributor
+
+## URLs
+
+Look at URLs and see if they look valid.
+
+Checks:
+- for "single page" springer domain (and others)
+- no 'None' web.archive.org links (old bug)