aboutsummaryrefslogtreecommitdiffstats
path: root/pig/README.md
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2018-05-08 06:44:39 +0000
committerBryan Newbold <bnewbold@archive.org>2018-05-08 06:44:39 +0000
commit681b085bc2a090b8db366c54780f1ec81d811403 (patch)
treeb5686571cd073b4798cd224aef5df746d12d784b /pig/README.md
parent2a1c887309305187d785b34a16c1868d26cb3273 (diff)
downloadsandcrawler-681b085bc2a090b8db366c54780f1ec81d811403.tar.gz
sandcrawler-681b085bc2a090b8db366c54780f1ec81d811403.zip
how to run pig in production
Diffstat (limited to 'pig/README.md')
-rw-r--r--pig/README.md5
1 files changed, 5 insertions, 0 deletions
diff --git a/pig/README.md b/pig/README.md
index 7b5806b..d14d2ae 100644
--- a/pig/README.md
+++ b/pig/README.md
@@ -28,3 +28,8 @@ to just download.
[local-pig]: https://hub.docker.com/r/chalimartines/local-pig
+## Run in Production
+
+ pig -param INPUT="/user/bnewbold/pdfs/global-20171227034923" \
+ -param OUTPUT="/user/bnewbold/pdfs/gwb-pdf-20171227034923-surt-filter" \
+ filter-cdx-paper-pdfs.pig