diff options
author | Bryan Newbold <bnewbold@robocracy.org> | 2020-06-04 14:01:34 -0700 |
---|---|---|
committer | Bryan Newbold <bnewbold@robocracy.org> | 2020-06-04 14:12:30 -0700 |
commit | a42d5f0d00e76bf8474647fae4e1d9d61693a7d9 (patch) | |
tree | f2556c2e40212da192517d0abd7c4f9e47e82cbb /python/tests/import_grobid_metadata.py | |
parent | 71e5662365892d32a5f92e2733b7ae804c833f57 (diff) | |
download | fatcat-a42d5f0d00e76bf8474647fae4e1d9d61693a7d9.tar.gz fatcat-a42d5f0d00e76bf8474647fae4e1d9d61693a7d9.zip |
ES schema: add best_url to file schema
This will increase index size (URLs are often long in our corpus, and we
have many file entities), but seems worth it.
Initially added `ia_url` as a second field, guaranteed to always be an
*.archive.org URL, but `best_url` defaults to that anyways so didn't
seem worthwhile.
Diffstat (limited to 'python/tests/import_grobid_metadata.py')
0 files changed, 0 insertions, 0 deletions