Welcome to Friday! Almost there…

In your group discuss:


Thursday’s questions

  1. Why should you use a workflow manager for your analysis instead of manually running tools?
  1. More accurate results
  2. Better documentation of results
  3. To avoid accidental mistakes
  4. Better reproducibility / portability of analysis
  1. What is the difference between single hyphens and double hyphens in a Nextflow command?
  1. There is no difference
  2. Single hyphens denote core Nextflow options, double are pipeline parameters
  3. Double hyphens denote core Nextflow options, single are pipeline parameters
  4. Depends on the pipeline
  1. Which two commands are useful for re-running a failed pipeline?
  1. nextflow pull
  2. nextflow log
  3. nextflow run -name
  4. nextflow run -resume
  1. Which peak caller does the nf-core/chipseq pipeline use?
  1. MACS2
  2. Genrich
  3. SEACR
  4. All of the above
  1. What was the name given to the new, modular language implemented within Nextflow
  1. Nextflow 2.0
  2. Nextflow modules
  3. DSL2
  4. nf-core/modules
  1. Which QC metric is not generated by the nf-core/chipseq pipeline?
  1. Strand cross-correlation plots
  2. Normalized strand coefficient & Relative strand correlation ratios
  3. TSS enrichment plots
  4. Gene count suitability plots
  1. Why does the nf-core/atacseq pipeline merge peaks sets across technical and biological replicates?
  1. To find a consensus set of peaks for downstream analysis
  2. To get give the best genome coverage possible
  3. To reduce background noise in the data
  4. To make the pipeline run faster
  1. Why are mitochondrial reads removed in the nf-core/atacseq pipeline?
  1. To get a better idea of the genome-wide duplication rate
  2. To avoid bias in downstream analysis
  3. Because some tissues and cell types can give libraries with a large fraction of mito reads
  4. All of the above