aboutsummaryrefslogtreecommitdiffstats
path: root/backfill/README.md
diff options
context:
space:
mode:
authorBryan Newbold <bnewbold@archive.org>2018-03-30 18:29:29 -0700
committerBryan Newbold <bnewbold@archive.org>2018-03-30 18:29:29 -0700
commitf6915b4b44e312cee7eda9626d0330268ab786e2 (patch)
treed4c3436d1bf010a269ab612d23e39e8ae2e9a52d /backfill/README.md
parent1b6d04fffb6554fd8ef1479b29c7154df07e4d21 (diff)
downloadsandcrawler-f6915b4b44e312cee7eda9626d0330268ab786e2.tar.gz
sandcrawler-f6915b4b44e312cee7eda9626d0330268ab786e2.zip
refactor backfill for mrjob
Diffstat (limited to 'backfill/README.md')
-rw-r--r--backfill/README.md8
1 files changed, 8 insertions, 0 deletions
diff --git a/backfill/README.md b/backfill/README.md
new file mode 100644
index 0000000..90b4ba7
--- /dev/null
+++ b/backfill/README.md
@@ -0,0 +1,8 @@
+
+Run tests:
+
+ pipenv run python -m pytest
+
+Run locally on a file:
+
+ ./backfill_hbase_from_cdx.py tests/files/example.cdx