diff options
author | Bryan Newbold <bnewbold@robocracy.org> | 2019-04-30 17:16:38 -0700 |
---|---|---|
committer | Bryan Newbold <bnewbold@robocracy.org> | 2019-04-30 17:16:38 -0700 |
commit | e12f584a658658d8393753a89b88186e8322e59c (patch) | |
tree | 20fefec2d362dde3354e26bf09de0af5daef0487 | |
parent | 2340854121b83f190094cd45f2a2da9e93ec5162 (diff) | |
download | fatcat-e12f584a658658d8393753a89b88186e8322e59c.tar.gz fatcat-e12f584a658658d8393753a89b88186e8322e59c.zip |
ideas for some bots to build
-rw-r--r-- | notes/bot_ideas.txt | 26 |
1 files changed, 26 insertions, 0 deletions
diff --git a/notes/bot_ideas.txt b/notes/bot_ideas.txt new file mode 100644 index 00000000..2d978279 --- /dev/null +++ b/notes/bot_ideas.txt @@ -0,0 +1,26 @@ + +check for known bad files (zero bytes, etc) by hash + +## extid / ExtidReviewBot + +Deeper check of external identifiers. + +Checks: +- duplication of entities: that no entities with the same identifiers already exist +- DOI numbers should be registered (check doi.org) + +## style + +Check release entity style. + +- release_type, status, etc are set +- title doesn't end in a period +- at least one contributor + +## URLs + +Look at URLs and see if they look valid. + +Checks: +- for "single page" springer domain (and others) +- no 'None' web.archive.org links (old bug) |