Shotgun Metagenomics
2025-11-03
Chapter 1 Introduction
There are many unknown and unculturable microbes found in a vast array of different environments. Shotgun metagenomics is an approach to capture all the genetic information in a sample, providing the taxonomic and metabolic information of all present organisms. In this course we will analyse shotgun metagenomic sequencing data from stool samples to compare Western and Korean diets.
Sessions will start with a brief presentation followed by self-paced computer practicals guided by this online interactive book. This book will contain theory and practice code. Multiple choice questions will guide the interpretation of results.
At the end of the course learners will be able to:
- Carry out quality control of sequencing data, including host removal.
- Explain how Kraken2 utilises k-mers to taxonomically classify sequence data.
- Quantify taxonomic composition of shotgun metagenomic data with Kraken2 & Bracken.
- View taxonomic abundances with Krona.
- Utilise LEfSe for biomarker detection.
- Profile the microbial pathways in metagenomic sequencing data with the bioBakery suite of tools, including HUMAnN.
- Perform a metagenome assembly with MEGAHIT, and assess it with QUAST.
- Carry out genomic binning of the metagenome assembly with MetaBAT2 to try to separate the different species present into Metagenome Assembled Genomes (MAGs).
- Assess the quality, completeness, and contamination levels of MAGs with CoCoPyE.
- Functional annotation of the bins with Bakta.
Table of contents
| Overview | Raw data |
|
|
| Trimming data | Host removal |
|
|
| Taxonomic profiling | Functional profiling |
|
|
| Metagenome assembly | Binning |
|
|
| Functional annotation | Appendix |
|
|

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.









