diff options
author | Bryan Newbold <bnewbold@archive.org> | 2018-05-08 06:44:39 +0000 |
---|---|---|
committer | Bryan Newbold <bnewbold@archive.org> | 2018-05-08 06:44:39 +0000 |
commit | 681b085bc2a090b8db366c54780f1ec81d811403 (patch) | |
tree | b5686571cd073b4798cd224aef5df746d12d784b | |
parent | 2a1c887309305187d785b34a16c1868d26cb3273 (diff) | |
download | sandcrawler-681b085bc2a090b8db366c54780f1ec81d811403.tar.gz sandcrawler-681b085bc2a090b8db366c54780f1ec81d811403.zip |
how to run pig in production
-rw-r--r-- | pig/README.md | 5 |
1 files changed, 5 insertions, 0 deletions
diff --git a/pig/README.md b/pig/README.md index 7b5806b..d14d2ae 100644 --- a/pig/README.md +++ b/pig/README.md @@ -28,3 +28,8 @@ to just download. [local-pig]: https://hub.docker.com/r/chalimartines/local-pig +## Run in Production + + pig -param INPUT="/user/bnewbold/pdfs/global-20171227034923" \ + -param OUTPUT="/user/bnewbold/pdfs/gwb-pdf-20171227034923-surt-filter" \ + filter-cdx-paper-pdfs.pig |