Exercises


Here we provide short tutorials on the different steps of scRNAseq analysis using either of the 3 commonly used scRNAseq analysis pipelines, Seurat, Scran and Scanpy. It is up to you which one you want to try out, if you finish quickly, you may have time to run several of them or run of the additional labs below. In principle we perform the same steps with all 3 pipelines, but there are some small differences as all different methods are not implemented in all the pipelines.


MAIN exercises

All scripts (Rmarkdown or ipython notebooks) can be found at our github repo in folder labs/compiled

Please me sure you have completed the Precourse material as well as the Conda instructions

Below you will find the link to the .Rmd that you should use as well as the rendered exercise report (“answers”).

Tutorial Seurat Scater/Scran Scanpy
Quality Control Seurat_qc (.Rmd) Scater_qc (.Rmd) ScanPY_qc (.ipynb)
Dimensionality reduction Seurat_dr (.Rmd) Scater_dr (.Rmd) Scanpy_dr (.ipynb)
Data integration Seurat_integr (.Rmd) Scater_integr (.Rmd) Scanpy_integr (.ipynb)
Clustering Seurat_clust (.Rmd) Scater_clust (.Rmd) Scanpy_clust (.ipynb)
Differential expression Seurat_dge (.Rmd) Scater_dge (.Rmd) Scanpy_dge (.ipynb)
Trajectory inference Slingshot_ti (.Rmd) Slingshot_ti PAGA_ti


Environments being used in the course (see Conda instructions ):

The easiest way of getting started with the exercises is to download the .Rmd/.ipynb file and then open it with Rstudio / Jypyter Notebooks. First activate your conda environment, then copy the link of a .Rmd/.ipynb file and then type:

wget <LINK_TO_Exercise1.Rmd_FILE>
rstudio Exercise1.Rmd &

Or in python for .ipynb:

wget <LINK_TO_Exercise1.ipynb_FILE>
jupyter notebook Exercise1.ipynb &

We highly recommend you to use the files provided instead of copying and pasting from the rendered report. Keep in mind that the results may vary slightly depending on the parameters used.



FAQ

As you run into problems, we will try to fill in the FAQ with common questions.



BONUS exercises

Please note that the exercises listed below belong to past courses and might not be completely updated. Nonetheless, they provide even more details and options to analyse single cell data.

Name (link) Description
Read-Pipeline Snakemake pipeline for processing SmartSeq2 data, mapping reads, QC and expression estimates
biomaRt For those not familiar with working with biomaRt, we suggest that you have a look at this example code for how to convert between different formats using biomaRt
PCA, tSNE and clustering Basic PCA, tSNE and clustering using base R on mouse embryonic development data.
KNN-graphs Construction of graphs from cell-cell similiarities using igraph
Estimating Batch-Effects A tutorial for estimating genome-wide and individual genes batch-effects
Normalization Comparison A tutorial for comparison scRNAseq and bulk RNAseq normalisation strategies.
SC3 package Tutorial with the SC3 consensus clustering package
Trajectory with Monocle2 A tutorial with mouse embryonic data using the Monocle package for pseudotime analysis
Differential expression_2 OBS! This old tutorial uses Seurat v2! For this tutorial we have included several different methods for differential expression tests on single cell data, including SCDE, MAST, SC3 and Seurat. The exercise has been split into 2 parts with evaluation of all results in the second part
UPPMAX Sbatch One example of a sbatch script
Pagoda Pagoda pathway wPCA for clustering of cells. OBS! several steps in this tutorial takes hours to run if you work with your own dataset, a good suggestion is to start with the first steps, knn.error.model, pagoda.varnorm and pagoda.pathway.wPCA and let it run while working on other tutorials. You can also run it with more than one core to speed things up

We will try to keep these tutorials up to date. If you find any errors or things that you think should be updated please contact Åsa (asa.bjorklund@scilifelab.se) or Paulo (paulo.czarnewski@scilifelab.se)


Icons are provided from [www.svgrepo.com](www.svgrepo.com)