From f39e851e1a08e0dc03a08b095d267e2bdf85125a Mon Sep 17 00:00:00 2001 From: Martin Czygan Date: Tue, 3 Nov 2020 18:04:58 +0100 Subject: update notes --- extra/README.md | 26 +++++++++++++++++++++++++- extra/static/datadir.png | Bin 0 -> 72700 bytes 2 files changed, 25 insertions(+), 1 deletion(-) create mode 100644 extra/static/datadir.png diff --git a/extra/README.md b/extra/README.md index 5267592..d4ee8a3 100644 --- a/extra/README.md +++ b/extra/README.md @@ -1,3 +1,27 @@ # Extra code and docs -Notes on applications. +Rough notes on applications. + +## Reproducibility + +Projects end up with a data folder, that is hard to reproduce. + +![](static/datadir.png) + +What you need to know: + +* commands invoked +* their order and dependencies + +We want to run N cluster algorithms and maybe for each of them run M +verification algorithms, so in the end, we can evaluate approaches against well +specified examples (test cases). + + +# Reading + +* [FAIR Computational Workflows](https://www.mitpressjournals.org/doi/pdf/10.1162/dint_a_00033) + +> Computational workflows describe the complex multi-step methods that are used +> for data collection, data preparation, analytics, predictive modelling, +> and simulation that lead to new data products. diff --git a/extra/static/datadir.png b/extra/static/datadir.png new file mode 100644 index 0000000..a0e7230 Binary files /dev/null and b/extra/static/datadir.png differ -- cgit v1.2.3