blob: e998728fd5f15ba9562f887f9cfbb967a5b00ed2 (
plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
|
Will probably eventually refactor into top-level plus modules. Eg, "common"
directory, "backfill" and "extraction" as sub-directories. Downside of this is
single giant pipenv venv with all dependencies?
- how to get argument (like --hbase-table) into mrjob.conf, or similar?
- fix pig gitlab-ci tests (JAVA_HOME). also make fetch_deps *way* more quiet
sentry:
- https://github.com/getsentry/raven-python
potential helpers:
- https://github.com/martinblech/xmltodict
- https://github.com/trananhkma/fucking-awesome-python#text-processing
- https://github.com/blaze/blaze (for catalog/analytics)
- validation: https://github.com/pyeve/cerberus
- testing (to replace nose):
- https://github.com/CleanCut/green
- pytest
- mamba ("behavior driven")
|