aboutsummaryrefslogtreecommitdiffstats
path: root/README.md
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2018-04-03 02:25:57 +0000
committerBryan Newbold <bnewbold@archive.org>2018-04-03 02:25:57 +0000
commit9d4520e8e18d7bf9b36d98d330417360194e80a3 (patch)
treebedee72dd46c0fc444476b7250cd59a51bbb4fb4 /README.md
parentca2d1bd7eccd619173e1d326c9a2ebb1a2d3d502 (diff)
downloadsandcrawler-9d4520e8e18d7bf9b36d98d330417360194e80a3.tar.gz
sandcrawler-9d4520e8e18d7bf9b36d98d330417360194e80a3.zip
shift docs around a bit
Diffstat (limited to 'README.md')
-rw-r--r--README.md16
1 files changed, 12 insertions, 4 deletions
diff --git a/README.md b/README.md
index 1a251eb..8589705 100644
--- a/README.md
+++ b/README.md
@@ -1,9 +1,9 @@
_ _
- _________ ___ __ _ _ __ __| | ___ _ __ __ ___ _| | ___ _ __
- \ | / __|/ _` | '_ \ / _` |/ __| '__/ _` \ \ /\ / / |/ _ \ '__|
- \ | \__ \ (_| | | | | (_| | (__| | | (_| |\ V V /| | __/ |
- \@@@@@@| |___/\__,_|_| |_|\__,_|\___|_| \__,_| \_/\_/ |_|\___|_|
+ __________ ___ __ _ _ __ __| | ___ _ __ __ ___ _| | ___ _ __
+ \ | / __|/ _` | '_ \ / _` |/ __| '__/ _` \ \ /\ / / |/ _ \ '__|
+ \ | \__ \ (_| | | | | (_| | (__| | | (_| |\ V V /| | __/ |
+ \ooooooo| |___/\__,_|_| |_|\__,_|\___|_| \__,_| \_/\_/ |_|\___|_|
This repo contains hadoop tasks (mapreduce and pig), luigi jobs, and other
@@ -14,3 +14,11 @@ This repository is potentially public.
Archive-specific deployment/production guides and ansible scripts at:
[journal-infra](https://git.archive.org/bnewbold/journal-infra)
+## Python Setup
+
+Pretty much everything here uses python/pipenv. To setup your environment for
+this, and python in general:
+
+ # libjpeg-dev is for some wayback/pillow stuff
+ sudo apt install python3-dev python3-pip python3-wheel libjpeg-dev
+ pip3 install --user pipenv