NanoMGT: Marker gene typing of low complexity mono-species metagenomic samples using noisy long reads

Malte B. Hallgren*, Philip T. L. C. Clausen, Frank M Aarestrup

*Corresponding author for this work

Research output: Contribution to journalJournal articleResearchpeer-review

17 Downloads (Orbit)

Abstract

Rapid advancements in sequencing technologies have led to significant progress in microbial genomics, yet challenges persist in accurately identifying microbial strain diversity in metagenomic samples, especially when working with noisy long-read data from platforms like Oxford Nanopore Technologies (ONT). In this article, we introduce NanoMGT, a tool designed to enhance marker gene typing in low-complexity mono-species samples, leveraging the unique properties of long reads. NanoMGT excels in its ability to accurately identify mutations amidst high error rates, ensuring the reliable detection of multiple strain-specific marker genes. Our tool implements a novel scoring system that rewards mutations co-occurring across different reads and penalizes densely grouped, likely erroneous variants, thereby achieving a good balance between sensitivity and precision. A comparative evaluation of NanoMGT, using a simulated multi-strain sample of seven bacterial species, demonstrated superior performance relative to existing tools and the advantages of using a threshold-based filtering approach to calling minority variants in ONT's sequencing data. NanoMGT's potential as a post-binning tool in metagenomic pipelines is particularly notable, enabling researchers to more accurately determine specific alleles and understand strain diversity in microbial communities. Our findings have significant implications for clinical diagnostics, environmental microbiology, and the broader field of genomics. The findings offer a reliable and efficient approach to marker gene typing in complex metagenomic samples.
Original languageEnglish
Article numberbpae057
JournalBiology Methods and Protocols
Volume9
Issue number1
Number of pages9
ISSN2396-8923
DOIs
Publication statusPublished - 2024

Keywords

  • Metagenomics
  • Marker gene typing
  • Long-read sequencing
  • Oxford Nanopore Technologies
  • Strain identification
  • Contamination
  • Variant calling
  • Microbial genomics
  • Bioinformatics tools

Fingerprint

Dive into the research topics of 'NanoMGT: Marker gene typing of low complexity mono-species metagenomic samples using noisy long reads'. Together they form a unique fingerprint.

Cite this