Finding the most significant common sequence and structure motifs in a set of RNA sequences

Jan Gorodkin, L.J. Heyer, G.D. Stormo

Research output: Contribution to journalJournal articleResearchpeer-review

Abstract

We present a computational scheme to locally align a collection of RNA sequences using sequence and structure constraints, In addition, the method searches for the resulting alignments with the most significant common motifs, among all possible collections, The first part utilizes a simplified version of the Sankoff algorithm for simultaneous folding and alignment of RNA sequences, but maintains tractability by constructing multi-sequence alignments from pairwise comparisons, The algorithm finds the multiple alignments using a greedy approach and has similarities to both CLUSTAL and CONSENSUS, but the core algorithm assures that the pairwise alignments are optimized for both sequence and structure conservation. The choice of scoring system and the method of progressively constructing the final solution are important considerations that are discussed, Example solutions, and comparisons with other approaches, are provided, The solutions include finding consensus structures identical to published ones.
Original languageEnglish
JournalNucleic Acids Research
Volume25
Issue number18
Pages (from-to)3724-3732
ISSN0305-1048
DOIs
Publication statusPublished - 1997

Cite this