aboutsummaryrefslogtreecommitdiffstats
path: root/TODO
blob: e998728fd5f15ba9562f887f9cfbb967a5b00ed2 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20

Will probably eventually refactor into top-level plus modules. Eg, "common"
directory, "backfill" and "extraction" as sub-directories. Downside of this is
single giant pipenv venv with all dependencies?

- how to get argument (like --hbase-table) into mrjob.conf, or similar?
- fix pig gitlab-ci tests (JAVA_HOME). also make fetch_deps *way* more quiet

sentry:
- https://github.com/getsentry/raven-python

potential helpers:
- https://github.com/martinblech/xmltodict
- https://github.com/trananhkma/fucking-awesome-python#text-processing
- https://github.com/blaze/blaze (for catalog/analytics)
- validation: https://github.com/pyeve/cerberus
- testing (to replace nose):
    - https://github.com/CleanCut/green
    - pytest
    - mamba ("behavior driven")