aboutsummaryrefslogtreecommitdiffstats
path: root/minio
diff options
context:
space:
mode:
Diffstat (limited to 'minio')
-rw-r--r--minio/README.md7
1 files changed, 7 insertions, 0 deletions
diff --git a/minio/README.md b/minio/README.md
index 8e8e29f..3ce0f95 100644
--- a/minio/README.md
+++ b/minio/README.md
@@ -11,6 +11,10 @@ Notable buckets, and structure/naming convention:
grobid/
2c/0d/2c0daa9307887a27054d4d1f137514b0fa6c6b2d.tei.xml
SHA1 (lower-case hex) of PDF that XML was extracted from
+ unpaywall/grobid/
+ 2c/0d/2c0daa9307887a27054d4d1f137514b0fa6c6b2d.tei.xml
+ SHA1 (lower-case hex) of PDF that XML was extracted from
+ (mirror of /grobid/ for which we crawled for unpaywall and made publicly accessible)
Create new buckets like:
@@ -22,3 +26,6 @@ Create a new readonly user like:
mc admin user add sandcrawler unpaywall $RANDOM_SECRET_KEY readonly
+Make a prefix within a bucket world-readable like:
+
+ mc policy set download sandcrawler/unpaywall/grobid