diff options
author | Bryan Newbold <bnewbold@archive.org> | 2021-10-11 16:29:10 -0700 |
---|---|---|
committer | Bryan Newbold <bnewbold@archive.org> | 2021-10-15 18:15:29 -0700 |
commit | 0666fa06fb48e6a856e63e9a06fa28e9a11761b3 (patch) | |
tree | a3efa004a49c33a668cd78bce3752b97aadbaf16 /proposals | |
parent | cc9c63d6b9d07c9d192c32e107254932f4b4a66b (diff) | |
download | sandcrawler-0666fa06fb48e6a856e63e9a06fa28e9a11761b3.tar.gz sandcrawler-0666fa06fb48e6a856e63e9a06fa28e9a11761b3.zip |
fileset ingest notes
Diffstat (limited to 'proposals')
-rw-r--r-- | proposals/2021-09-09_dataset_ingest.md | 26 |
1 files changed, 23 insertions, 3 deletions
diff --git a/proposals/2021-09-09_dataset_ingest.md b/proposals/2021-09-09_dataset_ingest.md index d4d2be4..cbfeb68 100644 --- a/proposals/2021-09-09_dataset_ingest.md +++ b/proposals/2021-09-09_dataset_ingest.md @@ -214,6 +214,26 @@ doi:10.7910/DVN/CLSFKX Mulitple files; multiple versions? -Single file inside: - -<https://dataverse.harvard.edu/file.xhtml?persistentId=doi:10.7910/DVN/CLSFKX/XWEHBB> +API fetch: <https://dataverse.harvard.edu/api/datasets/:persistentId/?persistentId=doi:10.7910/DVN/CLSFKX&version=1.1> + + .data.id + .data.latestVersion.datasetPersistentId + .data.latestVersion.versionNumber, .versionMinorNumber + .data.latestVersion.files[] + .dataFile + .contentType (mimetype) + .filename + .filesize (int, bytes) + .md5 + .persistendId + .description + .label (filename?) + .version + +Single file inside: <https://dataverse.harvard.edu/file.xhtml?persistentId=doi:10.7910/DVN/CLSFKX/XWEHBB> + +Download single file: <https://dataverse.harvard.edu/api/access/datafile/:persistentId/?persistentId=doi:10.7910/DVN/CLSFKX/XWEHBB> (redirects to AWS S3) + +Dataverse refs: +- 'doi' and 'hdl' are the two persistentId styles +- file-level persistentIds are optional, on a per-instance basis: https://guides.dataverse.org/en/latest/installation/config.html#filepidsenabled |