Abstract
Translation in eukaryotes does not always start at the first AUG in an mRNA, implying that context information also plays a role.This makes prediction of translation initiation sites a non-trivial task, especially when analysing EST and genome data where the entire mature mRNA sequence is not known. In this paper, we employ artificial neural networks to predict which AUG triplet in an mRNA sequence is the start codon. The trained networks correctly classified 88% of Arabidopsis and 85% of vertebrate AUG triplets. We find that our trained neural networks use a combination of local start codon context and global sequence information. Furthermore, analysis of false predictions shows that AUGs in frame with the actual start codon are more frequently selected than out-of-frame AUGs, suggesting that our nteworks use reading frame detection. A number of conflicts between neural network predictions and database annotations are analysed in detail, leading to identification of possible database errors.
Original language | English |
---|---|
Title of host publication | Proceedings Fifth International Conference on Intelligent Systems for Molecular Biology |
Place of Publication | Menlo Park, CA, USA |
Publisher | AAAI Press |
Publication date | 1997 |
Pages | 226-233 |
Publication status | Published - 1997 |
Event | 5th International Conference on Intelligent Systems for Molecular Biology - Halkidiki, Greece Duration: 21 Jun 1997 → 26 Jun 1997 Conference number: 5 http://www.aaai.org/Library/ISMB/ismb97contents.php |
Conference
Conference | 5th International Conference on Intelligent Systems for Molecular Biology |
---|---|
Number | 5 |
Country/Territory | Greece |
City | Halkidiki |
Period | 21/06/1997 → 26/06/1997 |
Internet address |