blob: e32768e74430e6db79d912fdb7c21be6ba90989e (
plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
|
# Extra code and docs
Rough notes on applications.
## Reproducibility
Projects end up with a data folder, that is hard to reproduce.
![](static/datadir.png)
What you need to know:
* commands invoked
* their order and dependencies
We want to run N cluster algorithms and maybe for each of them run M
verification algorithms, so in the end, we can evaluate approaches against well
specified examples (test cases).
# Reading
* [FAIR Computational Workflows](https://www.mitpressjournals.org/doi/pdf/10.1162/dint_a_00033)
> Computational workflows describe the complex multi-step methods that are used
> for data collection, data preparation, analytics, predictive modelling,
> and simulation that lead to new data products.
* [Sustainable data analysis](https://zenodo.org/record/4240139)
|