判别式
隐马尔可夫模型
拓扑(电路)
跨膜结构域
马尔可夫链
循环(图论)
计算机科学
算法
跨膜蛋白
螺旋(腹足类)
模式识别(心理学)
人工智能
数学
生物
机器学习
遗传学
组合数学
氨基酸
生态学
受体
蜗牛
作者
Erik L. L. Sonnhammer,Gunnar von Heijne,Anders Krogh
出处
期刊:PubMed
日期:1998-01-01
卷期号:6: 175-82
被引量:2501
摘要
A novel method to model and predict the location and orientation of alpha helices in membrane-spanning proteins is presented. It is based on a hidden Markov model (HMM) with an architecture that corresponds closely to the biological system. The model is cyclic with 7 types of states for helix core, helix caps on either side, loop on the cytoplasmic side, two loops for the non-cytoplasmic side, and a globular domain state in the middle of each loop. The two loop paths on the non-cytoplasmic side are used to model short and long loops separately, which corresponds biologically to the two known different membrane insertions mechanisms. The close mapping between the biological and computational states allows us to infer which parts of the model architecture are important to capture the information that encodes the membrane topology, and to gain a better understanding of the mechanisms and constraints involved. Models were estimated both by maximum likelihood and a discriminative method, and a method for reassignment of the membrane helix boundaries were developed. In a cross validated test on single sequences, our transmembrane HMM, TMHMM, correctly predicts the entire topology for 77% of the sequences in a standard dataset of 83 proteins with known topology. The same accuracy was achieved on a larger dataset of 160 proteins. These results compare favourably with existing methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI