Skip to main navigation Skip to search Skip to main content

Adaptive sequence alignment for metagenomic data analysis

  • Sami Pietilä
  • , Tomi Suomi
  • , Niklas Paulin
  • , Asta Laiho
  • , Yannes S. Sclivagnotis
  • , Laura L. Elo*
  • *Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

30 Downloads (Pure)

Abstract

With advances in sequencing technologies, the use of high-throughput sequencing to characterize microbial communities is becoming increasingly feasible. However, metagenomic assembly poses computational challenges in reconstructing genes and organisms from complex samples. To address this issue, we introduce a new concept called Adaptive Sequence Alignment (ASA) for analyzing metagenomic DNA sequence data. By iteratively adapting a set of partial alignments of reference sequences to match the sample data, the approach can be applied in multiple scenarios, from taxonomic identification to assembly of target regions of interest. To demonstrate the benefits of ASA, we present two application scenarios and compare the results with state-of-the-art methods conventionally used for the same tasks. In the first, ASA accurately detected microorganisms from a sequenced metagenomic sample with a known composition. The second illustrated the utility of ASA in assembling target genetic regions of the microorganisms. An example implementation of the ASA concept is available at https://github.com/elolab/ASA.

Original languageEnglish
Article number109743
JournalComputers in Biology and Medicine
Volume186
DOIs
Publication statusPublished - Mar 2025
MoE publication typeA1 Journal article-refereed

Funding

Prof. Elo reports grants from the European Union's Horizon 2020 research and innovation programme (955321), Academy of Finland (310561, 314443, 329278, 335434, 335611 and 341342), and Sigrid Juselius Foundation, during the conduct of the study. Our research is also supported by Biocenter Finland, and ELIXIR Finland. The authors wish to thank Olli Uhlgren for Graphical Abstract.

Keywords

  • Metagenomics
  • Sequence alignment
  • Sequence assembly
  • Taxonomic identification

Fingerprint

Dive into the research topics of 'Adaptive sequence alignment for metagenomic data analysis'. Together they form a unique fingerprint.

Cite this