aboutsummaryrefslogtreecommitdiffstats
path: root/extra/README.md
blob: e32768e74430e6db79d912fdb7c21be6ba90989e (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
# Extra code and docs

Rough notes on applications.

## Reproducibility

Projects end up with a data folder, that is hard to reproduce.

![](static/datadir.png)

What you need to know:

* commands invoked
* their order and dependencies

We want to run N cluster algorithms and maybe for each of them run M
verification algorithms, so in the end, we can evaluate approaches against well
specified examples (test cases).


# Reading

* [FAIR Computational Workflows](https://www.mitpressjournals.org/doi/pdf/10.1162/dint_a_00033)

> Computational workflows describe the complex multi-step methods that are used
> for data collection, data preparation,  analytics,  predictive  modelling,
> and  simulation  that  lead  to  new  data  products.

* [Sustainable data analysis](https://zenodo.org/record/4240139)