Here we provide short tutorials on the different steps of scRNAseq analysis using either of the 3 commonly used scRNAseq analysis pipelines, Seurat, Scran and Scanpy. It is up to you which one you want to try out, if you finish quickly, you may have time to run several of them or run of the additional labs below. In principle we perform the same steps with all 3 pipelines, but there are some small differences as all different methods are not implemented in all the pipelines.
All scripts (Rmarkdown or ipython notebooks) can be found at our github repo in folder labs/compiled
Please me sure you have completed the Precourse material as well as the Conda instructions
Below you will find the link to the .Rmd
that you should use as well as the rendered exercise report (“answers”).
Tutorial | Seurat | Scater/Scran | Scanpy |
---|---|---|---|
Quality Control | Seurat_qc (.Rmd) | Scater_qc (.Rmd) | ScanPY_qc (.ipynb) |
Dimensionality reduction | Seurat_dr (.Rmd) | Scater_dr (.Rmd) | Scanpy_dr (.ipynb) |
Data integration | Seurat_integr (.Rmd) | Scater_integr (.Rmd) | Scanpy_integr (.ipynb) |
Clustering | Seurat_clust (.Rmd) | Scater_clust (.Rmd) | Scanpy_clust (.ipynb) |
Differential expression | Seurat_dge (.Rmd) | Scater_dge (.Rmd) | Scanpy_dge (.ipynb) |
Trajectory inference | Slingshot_ti (.Rmd) | Slingshot_ti | PAGA_ti |
Environments being used in the course (see Conda instructions ):
The easiest way of getting started with the exercises is to download the .Rmd
/.ipynb
file and then open it with Rstudio / Jypyter Notebooks. First activate your conda environment, then copy the link of a .Rmd
/.ipynb
file and then type:
wget <LINK_TO_Exercise1.Rmd_FILE>
rstudio Exercise1.Rmd &
Or in python for .ipynb
:
wget <LINK_TO_Exercise1.ipynb_FILE>
jupyter notebook Exercise1.ipynb &
We highly recommend you to use the files provided instead of copying and pasting from the rendered report. Keep in mind that the results may vary slightly depending on the parameters used.
As you run into problems, we will try to fill in the FAQ with common questions.
Please note that the exercises listed below belong to past courses and might not be completely updated. Nonetheless, they provide even more details and options to analyse single cell data.
Name (link) | Description |
---|---|
Read-Pipeline | Snakemake pipeline for processing SmartSeq2 data, mapping reads, QC and expression estimates |
biomaRt | For those not familiar with working with biomaRt, we suggest that you have a look at this example code for how to convert between different formats using biomaRt |
PCA, tSNE and clustering | Basic PCA, tSNE and clustering using base R on mouse embryonic development data. |
KNN-graphs | Construction of graphs from cell-cell similiarities using igraph |
Estimating Batch-Effects | A tutorial for estimating genome-wide and individual genes batch-effects |
Normalization Comparison | A tutorial for comparison scRNAseq and bulk RNAseq normalisation strategies. |
SC3 package | Tutorial with the SC3 consensus clustering package |
Trajectory with Monocle2 | A tutorial with mouse embryonic data using the Monocle package for pseudotime analysis |
Differential expression_2 | OBS! This old tutorial uses Seurat v2! For this tutorial we have included several different methods for differential expression tests on single cell data, including SCDE, MAST, SC3 and Seurat. The exercise has been split into 2 parts with evaluation of all results in the second part |
UPPMAX Sbatch | One example of a sbatch script |
Pagoda | Pagoda pathway wPCA for clustering of cells. OBS! several steps in this tutorial takes hours to run if you work with your own dataset, a good suggestion is to start with the first steps, knn.error.model, pagoda.varnorm and pagoda.pathway.wPCA and let it run while working on other tutorials. You can also run it with more than one core to speed things up |
We will try to keep these tutorials up to date. If you find any errors or things that you think should be updated please contact Åsa (asa.bjorklund@scilifelab.se) or Paulo (paulo.czarnewski@scilifelab.se)