diff options
author | Bryan Newbold <bnewbold@robocracy.org> | 2020-08-11 15:45:36 -0700 |
---|---|---|
committer | Bryan Newbold <bnewbold@robocracy.org> | 2020-08-11 15:45:39 -0700 |
commit | 03d2004717d36962aef1bd373d59ce799d7db9ab (patch) | |
tree | 95e1863476f0e6c4fa0c9b3232e34d024cba0f85 /notes/performance/kafka_pipeline.txt | |
parent | a95b382a7add348c15bca4ed98729e47b17df11a (diff) | |
download | fatcat-03d2004717d36962aef1bd373d59ce799d7db9ab.tar.gz fatcat-03d2004717d36962aef1bd373d59ce799d7db9ab.zip |
entity update: change big5 ingest behavior
In addition to changing the OA default, this was the main intended
behavior change in this group of commits: want to ingest fewer attempts
that we *expect* to fail, but default to ingest/crawl attempt if we are
uncertain. This is because there is a long tail of journals that
register DOIs and are defacto OA (fulltext is available), but we don't
have metadata indicating them as such.
Diffstat (limited to 'notes/performance/kafka_pipeline.txt')
0 files changed, 0 insertions, 0 deletions