This workshop is aimed towards biologists, researchers, computer scientists or data analysts with limited experience in analysing NGS data.
These are steps to be completed before the workshop.
Remote computing clusters (UPPMAX and NSC) will be use for data analyses. A SUPR/SNIC account is needed to use these resources.
If you do not already have one, create an account at https://supr.snic.se/.
Log in to SUPR/SNIC and request membership to the project IDs g2021013 and snic2021-22-101.
Once you are accepted to a project, you should see that project listed under your active projects.
Finally you need to request a login account to NSC. Login to SUPR and go to the Accounts page. Under the Possible Resource Account Requests heading click on Request Account on Tetralith @ NSC button and confirm it on the next page.
Checking your request and approving your account requires some manual work, so you might have to wait for some time (up to a working day) before the next step. When the account is ready to be created, you will receive an email to your registered email address (shown in your SUPR contact information) with information on how to proceed. You will get a URL that you use to choose the password (within seven days). When that has been done, the account ready for use within 15 minutes and you can then login using your chosen password.
Note: You will get one username & password for the account on UPPMAX, and one username and password for the account on NSC. Please keep track of both, we will tell you when to use which account during the workshop.
You need a program to connect to a remote cluster (UPPMAX and NSC). Linux and Mac users already have terminal on their systems. If you are on a Windows system, we recommend MobaXterm. It is recommended that you INSTALL the program and not use the portable version. MobaXterm also has an integrated SFTP file browser.
Mac users will need to download and install XQuartz for X11 forwarding. ie; to forward remotely opened windows to local machine.
When you need to transfer data between the remote cluster and your computer, you can use the tools SCP or SFTP through the terminal. Windows users can use the SFTP browser available with MobaXterm. If you prefer a GUI to upload and download files from the remote cluster, we recommend installing FileZilla.
For this step, you will have to use the terminal a bit. You can get started by following Tutorial One at this link Unix tutorial for beginners. You can use https://scilifelab.github.io/courses/ngsintro/common/emu/ (or this mirror) to try the commands in the tutorial, so that you don’t mess up any real world system. If you any questions regarding this tutorial contact: martin.dahlo [at] scilifelab.uu.se.
Connect to UPPMAX: Open the terminal (Windows users can use MobaXterm) and type ssh -Y user@rackham.uppmax.uu.se
, then enter your password. The password will not be visible as you type.
Create a user folder: Go to /proj/g2021013/nobackup/
and create a directory with your username. For example mkdir jody
. You will work inside this directory for the workshop. For example /proj/g2021013/nobackup/jody
. If you cannot write to the folder, the most likely reason is that you have not requested access to the workshop project via SUPR. This is described in step 1 above.
Note: It may take an hour or so from request approval, before you can actually write to the folder. We will check before the workshop that all students have logged in and done this, so do not forget!
Please install IGV on your local system before the start of the workshop.
Download IGV (Integrated Genome Browser) from the Broad Institute on your own computer and have the mouse genome (mm10) as well as the human genome (hg19) available.
The syllabus for this workshop are as follows.
After this workshop you should be able to: