aboutsummaryrefslogtreecommitdiffstats
path: root/minio
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2019-09-20 20:04:53 -0700
committerBryan Newbold <bnewbold@archive.org>2019-09-20 20:04:53 -0700
commita139bcc56911e83ecba55ab6474d6aa867d9d02f (patch)
tree80ba87d11d7bc42e35c8fac8b5ce997d7b211acc /minio
parentf89906a442977a48f99bbb8b52ba7c60ec366c89 (diff)
downloadsandcrawler-a139bcc56911e83ecba55ab6474d6aa867d9d02f.tar.gz
sandcrawler-a139bcc56911e83ecba55ab6474d6aa867d9d02f.zip
update service docs
Diffstat (limited to 'minio')
-rw-r--r--minio/README.md7
1 files changed, 7 insertions, 0 deletions
diff --git a/minio/README.md b/minio/README.md
index 8e8e29f..3ce0f95 100644
--- a/minio/README.md
+++ b/minio/README.md
@@ -11,6 +11,10 @@ Notable buckets, and structure/naming convention:
grobid/
2c/0d/2c0daa9307887a27054d4d1f137514b0fa6c6b2d.tei.xml
SHA1 (lower-case hex) of PDF that XML was extracted from
+ unpaywall/grobid/
+ 2c/0d/2c0daa9307887a27054d4d1f137514b0fa6c6b2d.tei.xml
+ SHA1 (lower-case hex) of PDF that XML was extracted from
+ (mirror of /grobid/ for which we crawled for unpaywall and made publicly accessible)
Create new buckets like:
@@ -22,3 +26,6 @@ Create a new readonly user like:
mc admin user add sandcrawler unpaywall $RANDOM_SECRET_KEY readonly
+Make a prefix within a bucket world-readable like:
+
+ mc policy set download sandcrawler/unpaywall/grobid