Start by having a look at the Tutorials for analysis of ST data and integration with SC data using Seurat, Scran or Squidpy.
Select wichever pipeline you feel is most relevant for you to use.
1.1. Load the relevant ST data for your project into the data object of your choice (AnnData, SeuratObject, SCE etc.).
1.2. Have a look at the object and describe in your report briefly what the different slots of the objects are.
1.3 Concatenate the different sections into one object if relevant for that data type.
2.1 Calculate quality metrics such as number of genes, UMIs, and mitochondrial reads per spot.
2.2 Visualize the QC stats on a per spot basis, do you see even distribution across spots/sections?
2.3 Possibly filter out low quality spots, discuss with your group and motivate how the data should be filtered.
2.4 Filter genes, we suggest to always remove mitochondrial genes, but possibly also other problematic genes (e.g., ribosomal and Hb-genes).
3.1 Select a relevant set of genes for dimensionality reduction, motivate why you think this is a relevant choice.
3.2 Start with PCA and make a reasonable selection of number of principal components.
3.3 Run a graph based dimensionality reduction, like UMAP or another method of choice.
3.4 Cluster the spots with your method of choice, motivate why you are using that method.
4.1 Identify the area of interest (VISp) region and create a new object with only the selected region from 2 sections of your choice. Discuss with your group what region to select so that everyone uses the same.
4.2 Visualize the region
4.3 Rerun dimensionality reduction and clustering with the selected region. Does it change the clustering to what you had with the full dataset? Discuss why.
5.1 Perform clustering and QC of the data and compare to the annotations provided by the paper. Does it agree well do you think?
5.2 Perform Differential Expression analysis, this may be a relevant gene set to use for aligning the SC data to the ST data.
6.1 Look at existing methods for aligning SC and ST data and select a few that you think are relavant. Discuss with the group what to select and why.
6.2 Select features for integration/deconvolution, this can be done using variable genes in SC data only, both datasets or DEGs from SC data. Discuss what you think is the best method to use and possibly try out a few different ones if time allows.
6.3 Run the methods you selected.
6.4 Compare the results, what are main similiarities/differeces between the tools?
If time allows we have listed a few additional things you could try out if time allows.