膜拓扑
秀丽隐杆线虫
跨膜蛋白
膜蛋白
网络拓扑
计算生物学
跨膜结构域
马尔可夫链
隐马尔可夫模型
拓扑(电路)
整体膜蛋白
编码
生物
基因组
生物系统
计算机科学
遗传学
基因
膜
人工智能
数学
机器学习
受体
组合数学
操作系统
作者
Anders Krogh,B. Larsson,Gunnar von Heijne,Erik L. L. Sonnhammer
标识
DOI:10.1006/jmbi.2000.4315
摘要
We describe and validate a new membrane protein topology prediction method, TMHMM, based on a hidden Markov model. We present a detailed analysis of TMHMM's performance, and show that it correctly predicts 97-98 % of the transmembrane helices. Additionally, TMHMM can discriminate between soluble and membrane proteins with both specificity and sensitivity better than 99 %, although the accuracy drops when signal peptides are present. This high degree of accuracy allowed us to predict reliably integral membrane proteins in a large collection of genomes. Based on these predictions, we estimate that 20-30 % of all genes in most genomes encode membrane proteins, which is in agreement with previous estimates. We further discovered that proteins with N(in)-C(in) topologies are strongly preferred in all examined organisms, except Caenorhabditis elegans, where the large number of 7TM receptors increases the counts for N(out)-C(in) topologies. We discuss the possible relevance of this finding for our understanding of membrane protein assembly mechanisms. A TMHMM prediction service is available at http://www.cbs.dtu.dk/services/TMHMM/.
科研通智能强力驱动
Strongly Powered by AbleSci AI