A hidden Markov model for prediction transmembrane helices in proteinsequences

Erik L.L. Sonnhammer, Gunnar von Heijne, Anders Stærmose Krogh

    Research output: Chapter in Book/Report/Conference proceedingArticle in proceedingsResearchpeer-review

    Abstract

    A novel method to model and predict the location and orientation of alpha helices in membrane-spanning proteins is presented. It is based on a hidden Markov model (HMM) with an architecture that corresponds closely to the biological system. The model is cyclic with 7 types of states for helix core, helix caps on either side, loop on the cytoplasmic side, two loops for the non-cytoplasmic side, and a globular domain state in the middle of each loop. The two loop paths on the non-cytoplasmic side are used to model short and long loops separately, which corresponds biologically to the two known different membrane insertions mechanisms. The close mapping between the biological and computational states allows us to infer which parts of the model architecture are important to capture the information that encodes the membrane topology, and to gain a better understanding of the mechnaisms and constraints involved. Models were estimated both by maximum likelihood and a discriminative method, and a method for reassignment of the membrane helix boundaries were developed. In a cross validated test on single sequences, our transmembrane HMM, TMHMM, correctly predicts the entire topology for 77% of the sequences in a standard dataset of 83 proteins with known topology. The same accuracy was achieved on a larger dataset of 160 proteins. These results compare favourably with existing methods.
    Original languageEnglish
    Title of host publicationProceedings of the Sixth International Conference on Intelligent Systems for Molecular Biology
    Place of PublicationMenlo Park
    PublisherAAAI Press
    Publication date1998
    Pages175-18
    Publication statusPublished - 1998
    EventSixth International Conference on Intelligent Systems for Molecular Biology - Montreal, Canada
    Duration: 28 Jun 19981 Jul 1998
    Conference number: 6
    https://web.archive.org/web/20140223112627/http://www-lbit.iro.umontreal.ca/ISMB98/

    Conference

    ConferenceSixth International Conference on Intelligent Systems for Molecular Biology
    Number6
    Country/TerritoryCanada
    CityMontreal
    Period28/06/199801/07/1998
    Internet address

    Cite this