diff options
author | Bryan Newbold <bnewbold@robocracy.org> | 2020-01-29 21:57:32 -0800 |
---|---|---|
committer | Bryan Newbold <bnewbold@robocracy.org> | 2020-01-29 21:59:05 -0800 |
commit | d5d83762063b8ec7f512c20567f46c03f2e6b542 (patch) | |
tree | 8e836c4dedd169ee85512fbf9e273c64a4d8a4be | |
parent | d58c3891ac2122dac53ced606568108f543f2d80 (diff) | |
download | fatcat-d5d83762063b8ec7f512c20567f46c03f2e6b542.tar.gz fatcat-d5d83762063b8ec7f512c20567f46c03f2e6b542.zip |
update ES docs and proposal
-rw-r--r-- | extra/elasticsearch/README.md | 2 | ||||
-rw-r--r-- | proposals/2020_elasticsearch_schemas.md | 8 |
2 files changed, 6 insertions, 4 deletions
diff --git a/extra/elasticsearch/README.md b/extra/elasticsearch/README.md index 3a48a178..3e0857b4 100644 --- a/extra/elasticsearch/README.md +++ b/extra/elasticsearch/README.md @@ -40,9 +40,11 @@ Drop and rebuild the schema: http delete :9200/fatcat_release http delete :9200/fatcat_container + http delete :9200/fatcat_file http delete :9200/fatcat_changelog http put :9200/fatcat_release < release_schema.json http put :9200/fatcat_container < container_schema.json + http put :9200/fatcat_file < file_schema.json http put :9200/fatcat_changelog < changelog_schema.json Put a single object (good for debugging): diff --git a/proposals/2020_elasticsearch_schemas.md b/proposals/2020_elasticsearch_schemas.md index 83db884f..5fb28d19 100644 --- a/proposals/2020_elasticsearch_schemas.md +++ b/proposals/2020_elasticsearch_schemas.md @@ -14,8 +14,6 @@ Simple additions: - pages - `first_page` (parsed from pages) (?) - number -- `in_shadow` -- OA license slug (?) - `doi_prefix` - `doi_registrar` (based on extra) - `first_author` (surname; for matching) @@ -25,6 +23,8 @@ Simple additions: - referenced releases idents - contrib creator idents +Add affiliations, both as raw strings and ROR identifiers. + ## Preservation Summary Field @@ -128,8 +128,8 @@ hit does not}"). ## Container Fields -- `all_issns` -- `release_count` +- `issn` (all issns) +- `original_name` The `release_count` would not be indexed (left null) by default, and would be "patched" in to entities by a separate script (periodically?). |