Benchmarking software tools for trimming adapters and merging next-generation sequencing data for ancient DNA

Annette Lien, Leonardo Pestana Legori, Louis Kraft, Peter Wad Sackett, Gabriel Renaud*

*Corresponding author for this work

Research output: Contribution to journalJournal articleResearchpeer-review

40 Downloads (Pure)

Abstract

Ancient DNA is highly degraded, resulting in very short sequences. Reads generated with modern high-throughput sequencing machines are generally longer than ancient DNA molecules, therefore the reads often contain some portion of the sequencing adaptors. It is crucial to remove those adaptors, as they can interfere with downstream analysis. Furthermore, overlapping portions when DNA has been read forward and backward (paired-end) can be merged to correct sequencing errors and improve read quality. Several tools have been developed for adapter trimming and read merging, however, no one has attempted to evaluate their accuracy and evaluate their potential impact on downstream analyses. Through the simulation of sequencing data, seven commonly used tools were analyzed in their ability to reconstruct ancient DNA sequences through read merging. The analyzed tools exhibit notable differences in their abilities to correct sequence errors and identify the correct read overlap, but the most substantial difference is observed in their ability to calculate quality scores for merged bases. Selecting the most appropriate tool for a given project depends on several factors, although some tools such as fastp have some shortcomings, whereas others like leeHom outperform the other tools in most aspects. While the choice of tool did not result in a measurable difference when analyzing population genetics using principal component analysis, it is important to note that downstream analyses that are sensitive to wrongly merged reads or that rely on quality scores can be significantly impacted by the choice of tool.

Original languageEnglish
Article number1260486
JournalFrontiers in Bioinformatics
Volume3
Number of pages18
ISSN2673-7647
DOIs
Publication statusPublished - 2023

Keywords

  • Adapter trimming
  • Ancient DNA
  • Benchmarking
  • Next-generation sequencing
  • Read merging
  • Read processing

Fingerprint

Dive into the research topics of 'Benchmarking software tools for trimming adapters and merging next-generation sequencing data for ancient DNA'. Together they form a unique fingerprint.

Cite this