aboutsummaryrefslogtreecommitdiffstats
path: root/pig/README.md
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2018-08-24 12:05:39 -0700
committerBryan Newbold <bnewbold@archive.org>2018-08-24 12:05:41 -0700
commit92584ec4201ecc27af423cbff7b4bc1573edf175 (patch)
tree416d1fae8bd82af1f470a1be25c8763da042a5e4 /pig/README.md
parente81774a66980ba17c42380884f39aa61b54e5eef (diff)
downloadsandcrawler-92584ec4201ecc27af423cbff7b4bc1573edf175.tar.gz
sandcrawler-92584ec4201ecc27af423cbff7b4bc1573edf175.zip
rework fetch_hadoop script
Should work on macOS now, and fetches hadoop in addition to pig. Still requires wget (not installed by default on macOS).
Diffstat (limited to 'pig/README.md')
-rw-r--r--pig/README.md9
1 files changed, 5 insertions, 4 deletions
diff --git a/pig/README.md b/pig/README.md
index d14d2ae..df8ce68 100644
--- a/pig/README.md
+++ b/pig/README.md
@@ -12,12 +12,13 @@ by `fetch_deps.sh`) due to [dependency/jar issues][pig-bug] in local mode of
To run tests, you need Java installed and `JAVA_HOME` configured.
-Fetch dependencies (pig):
+Fetch dependencies (including pig) from top-level directory:
- ./fetch_deps.sh
+ ./fetch_hadoop.sh
-Write .pig scripts here, and add a pytho wrapper test to `./tests/` when done.
-Test vector files (input/output) can go in `./tests/files/`.
+Write `.pig` scripts in this directory, and add a python wrapper test to
+`./tests/` when done. Test vector files (input/output) can go in
+`./tests/files/`.
Run the tests with: