Module 3: Omics Analysis
This module marks the transition from computational literacy to biological discovery. After Linux fundamentals (Module 1) and environment management (Module 2), you now apply these skills to high-throughput sequencing data and complete end-to-end analysis workflows.
Learning objectives
Section titled “Learning objectives”By the end of this module, you will be able to:
- Explain the omics hierarchy and flow of information from genotype to phenotype
- Interpret the core sequencing file formats used in analysis workflows
- Run reproducible quality-control workflows with FastQC and MultiQC
- Compare de novo and reference-based assembly strategies
- Execute long-read de novo and reference-guided assembly pipelines
Lessons
Section titled “Lessons”- Lesson 1: The omics hierarchy and file architecture
- Covers the biological omics layers and the FASTQ/FASTA/BAM/VCF data model
- Lesson 2: Precision quality control (QC)
- Introduces Phred scoring, FastQC, and MultiQC interpretation
- Lesson 3: Genome assembly foundations
- Explains assembly concepts, coverage, repeats, and tool choices
- Lesson 4: De novo assembly with long reads
- Walks through host-read filtering, assembly with Flye, and BLAST annotation
- Lesson 5: Reference-based assembly
- Covers mapping to ACMV references, QC metrics, and consensus generation
Note: Use your
bioinfoenvironment for tool-heavy practical lessons.