Abstract
A hidden Markov model of signal peptides has been developed. It
contains submodels for the N-terminal part, the hydrophobic region
and the region around the cleavage site. For known signal
peptides, the model can be used to assign objective boundaries
between these three regions. Applied to our data, the length
distributions for the three regions are significantly different
from expectations. For instance, the assigned hydrophobic region
is between 8 and 12 residues long in almost all eukaryotic signal
peptides. This analysis also makes obvious the difference between
eukaryotes, Gram-positive bacteria, and Gram-negative bacteria.
The model can be used to predict the location of the cleavage
site, which it finds correctly in nearly 70% of signal peptides in
a cross-validated test - almost the same accuracy as the best
previous method. One of the problems for existing prediction
methods is the poor discrimination between signal peptides and
uncleaved signal anchors, but this is substantially improved by
the hidden Markov model when expanding it with a very simple
signal anchor model.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the Sixth International Conference on Intelligent Systems for Molecular Biology |
| Place of Publication | Menlo Park |
| Publisher | AAAI Press |
| Publication date | 1998 |
| Pages | 122-130 |
| Publication status | Published - 1998 |
| Event | Sixth International Conference on Intelligent Systems for Molecular Biology - Montreal, Canada Duration: 28 Jun 1998 → 1 Jul 1998 Conference number: 6 https://web.archive.org/web/20140223112627/http://www-lbit.iro.umontreal.ca/ISMB98/ |
Conference
| Conference | Sixth International Conference on Intelligent Systems for Molecular Biology |
|---|---|
| Number | 6 |
| Country/Territory | Canada |
| City | Montreal |
| Period | 28/06/1998 → 01/07/1998 |
| Internet address |
Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver