Skip to content

Module 3: Omics Analysis

This module marks the transition from computational literacy to biological discovery. After Linux fundamentals (Module 1) and environment management (Module 2), you now apply these skills to high-throughput sequencing data and complete end-to-end analysis workflows.

By the end of this module, you will be able to:

  • Explain the omics hierarchy and flow of information from genotype to phenotype
  • Interpret the core sequencing file formats used in analysis workflows
  • Run reproducible quality-control workflows with FastQC and MultiQC
  • Compare de novo and reference-based assembly strategies
  • Execute long-read de novo and reference-guided assembly pipelines
  • Lesson 1: The omics hierarchy and file architecture
    • Covers the biological omics layers and the FASTQ/FASTA/BAM/VCF data model
  • Lesson 2: Precision quality control (QC)
    • Introduces Phred scoring, FastQC, and MultiQC interpretation
  • Lesson 3: Genome assembly foundations
    • Explains assembly concepts, coverage, repeats, and tool choices
  • Lesson 4: De novo assembly with long reads
    • Walks through host-read filtering, assembly with Flye, and BLAST annotation
  • Lesson 5: Reference-based assembly
    • Covers mapping to ACMV references, QC metrics, and consensus generation

Note: Use your bioinfo environment for tool-heavy practical lessons.