化学
刀切重采样
特征(语言学)
支持向量机
二肽
酶
生物系统
人工智能
生物化学
氨基酸
数学
统计
计算机科学
估计员
哲学
生物
语言学
作者
Xianfang Wang,Hongfei Li,Peng Gao,Yifeng Liu,Wenjing Zeng
标识
DOI:10.2174/1570178615666180925125912
摘要
The catalytic activity of the enzyme is different from that of the inorganic catalyst. In a high-temperature, over-acid or over-alkaline environment, the structure of the enzyme is destroyed and then loses its activity. Although the biochemistry experiments can measure the optimal PH environment of the enzyme, these methods are inefficient and costly. In order to solve these problems, computational model could be established to determine the optimal acidic or alkaline environment of the enzyme. Firstly, in this paper, we introduced a new feature called dual g-gap dipeptide composition to formulate enzyme samples. Subsequently, the best feature was selected by using the F value calculated from analysis of variance. Finally, support vector machine was utilized to build prediction model for distinguishing acidic from alkaline enzyme. The overall accuracy of 95.9% was achieved with Jackknife cross-validation, which indicates that our method is professional and efficient in terms of acid and alkaline enzyme predictions. The feature proposed in this paper could also be applied in other fields of bioinformatics.
科研通智能强力驱动
Strongly Powered by AbleSci AI