diff options
author | Bryan Newbold <bnewbold@robocracy.org> | 2020-03-28 19:57:35 -0700 |
---|---|---|
committer | Bryan Newbold <bnewbold@robocracy.org> | 2020-03-28 19:57:41 -0700 |
commit | 4b75a81cbd0faeefa6a0f04b97ecc6832924ee69 (patch) | |
tree | e729b83e33cfb686448a24129d010a088e1289dc /notes/url_structure.txt | |
parent | 2a3148df02962b84d0409f9f9900324d04404065 (diff) | |
download | fatcat-4b75a81cbd0faeefa6a0f04b97ecc6832924ee69.tar.gz fatcat-4b75a81cbd0faeefa6a0f04b97ecc6832924ee69.zip |
ingest: more DOI patterns to treat as OA
These are journal/publisher patterns which we suspect to actually be OA
based on the large quantity of papers that crawl successfully. The
better long-term solution will be to flag containers in some way as OA
(or "should crawl"), but this is a good short-term solution.
Diffstat (limited to 'notes/url_structure.txt')
0 files changed, 0 insertions, 0 deletions