Spatial transcriptomics (ST) allows for visualization and quantitative analysis of gene expression in tissue sections. There is a multitude of recent methods for ST, but here we will focus on data from the Visium platform.
Spatial transcriptomics data generated with the Visium platform is in many ways similar to single cell transcriptomics (SC). It contains UMI counts from about 5-20 cells per spot (spatial capture location) instead of single cells, meaning that the expression profile of a spot represents a mix of transcripts originating from all cells contributing to said spot. Despite several cells contributing to each spot, ST data - just as SC data - is still quite sparse. The major difference between SC and Visum data, aside from the latter not operating at single cell resolution, is that each observation (spot) has spatial information associated with it, i.e., we know where in the tissue the captured transcripts come from.
As mentioned, there are far more spatial transcriptomics techniques than just Visium, and new ones are continuously published, hence it's fair to say that this is a rapidly evolving field. Simultaneous to this introduction of new experimental techniques, we've seen a surge in new computational tools and methods for analysis of ST data. In this course we aim to make you familiar with a few of them.
Each task will be done in a group of four students, but many of the steps involves testing different tools or different analysis parameters. So we suggest that you subdivide the tasks within your group as you see fit.
In both projects we will work with Visium mouse brain sections, but each group will be focusing on different objectives.
In this project you will work with mouse brain sections from Kleshchevnikov et al. which can be downloaded
here
or using wget
:
wget https://cell2location.cog.sanger.ac.uk/tutorial/mouse_brain_visium_wo_cloupe_data.zip
alternatively curl
:
curl https://cell2location.cog.sanger.ac.uk/tutorial/mouse_brain_visium_wo_cloupe_data.zip -O mouse_brain_visium_wo_cloupe_data.zip
In this project you will focus on the integration of single cell data with spatial data from the primary visual cortex (VISp) region, a well-studied part of the cortex. What makes this structure interesting to study from a spatial perspective is that it exhibits a form of layering where different types of neurons are arranged in six laminar layers (Layer I to VI), see image below:
If your are unfamiliar with the mouse brain or just want some guidance when orienting your tissues, we highly recommend the Allen Brain Atlas' interactive reference, which you may find here.
The single cell data we'll be using was originally published by Tasic et al. and can be accessed
here. In order
to save you some time, we've generated a "ready-to-use" h5ad
file for you, containing both the count and meta data,
which you can download here. If you are interested, the script we used to
create this h5ad
file is located in scripts/sc_make_h5ad.py
. h5ad
files can
easily be loaded in python
using the anndata
or scanpy
packages, there's
also support for these files in R
for example using the readH5AD
function
from the zellkonverter
package.
We have provided conda recipies for some of the most common tools that you may want to use in the projects in this folder
If you are not familiar with conda, please have a look at the
Precourse_material. There are separate environments for R
and
python
, where we suggest you use whatever language you feel most comfortable
with; please note that you do not have to limit yourself to one language per
group, in fact it's a good exercise to try to coordinate data and results
between different suites and languages.
Please refer to our Tips and tricks where we try to list common issues with some of the most common tools.