Introduction
24-Jun-2025
Good practices for working with data
How to use the version control system Git to track changes to code
How to use the package and environment manager Conda
How to use the workflow managers Snakemake and Nextflow
How to generate automated reports using Quarto and Jupyter
How to use Docker and Apptainer to distribute containerised computational environments
National Bioinformatics Infrastructure Sweden
In 2015 the Open Science Collaboration set out to replicate 100 experiments published in high-impact psychology journals. 1
The same year, money spent on preclinical research that could not be reproduced was estimated at $28 billion in the US. 1
In 2016, 1,576 scientists were surveyed about reproducibility. 1
In 2016, 1,576 scientists were surveyed about reproducibility. 1
In 2018, Stodden et al estimated the reproducibility rate of computational papers published in the journal Science. 1
More examples:
Innovation points out paths that are possible; replication points out paths that are likely; progress relies on both. 1
Data | |||
Same | Different | ||
Code | Same | Reproducible | Replicable |
Different | Robust | Generalisable |
Decent:
Good:
Great:
.csv
rather than .xls
.Decent:
Good:
Great:
Decent:
Good:
Great:
Before the project:
During the project:
After the project: