aboutsummaryrefslogtreecommitdiffstats
path: root/scald-mvp/README.md
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2018-05-21 10:55:22 -0700
committerBryan Newbold <bnewbold@archive.org>2018-05-21 10:55:22 -0700
commit5d5b828730fdf34dcd2a6aeba64c7df2c1be23c5 (patch)
tree5f7322b2dc25317ab847d98e182ce801a1a5eae6 /scald-mvp/README.md
parentb18c68c81b4c426b5d83f2e6c31026b9febcb6e0 (diff)
downloadsandcrawler-5d5b828730fdf34dcd2a6aeba64c7df2c1be23c5.tar.gz
sandcrawler-5d5b828730fdf34dcd2a6aeba64c7df2c1be23c5.zip
copy in scalding learning example
Diffstat (limited to 'scald-mvp/README.md')
-rw-r--r--scald-mvp/README.md30
1 files changed, 30 insertions, 0 deletions
diff --git a/scald-mvp/README.md b/scald-mvp/README.md
new file mode 100644
index 0000000..10cac0f
--- /dev/null
+++ b/scald-mvp/README.md
@@ -0,0 +1,30 @@
+
+following https://medium.com/@gayani.nan/how-to-run-a-scalding-job-567160fa193
+
+
+running on my laptop:
+
+ openjdk version "1.8.0_171"
+ OpenJDK Runtime Environment (build 1.8.0_171-8u171-b11-1~deb9u1-b11)
+ OpenJDK 64-Bit Server VM (build 25.171-b11, mixed mode)
+
+ Scala code runner version 2.11.8 -- Copyright 2002-2016, LAMP/EPFL
+
+ sbt: 1.1.5
+
+ sbt new scala/scala-seed.g8
+
+ # inserted additional deps, tweaked versions
+ # hadoop 2.5.0 seems to conflict with cascading; sticking with 2.6.0
+
+ sbt assembly
+ scp target/scala-2.11/scald-mvp-assembly-0.1.0-SNAPSHOT.jar devbox:
+
+ # on cluster:
+ yarn jar scald-mvp-assembly-0.1.0-SNAPSHOT.jar WordCount --hdfs --input hdfs:///user/bnewbold/dummy.txt
+
+## ATTIC
+
+wrote build.sbt from scratch
+
+`sbt` command from `twitter/scalding` upstream repo