琥珀酰化
过度拟合
计算机科学
算法
选择(遗传算法)
线性判别分析
人工智能
序列(生物学)
模式识别(心理学)
生物系统
生物
赖氨酸
生物化学
氨基酸
人工神经网络
作者
Yixiao Xia,Minchao Jiang,Yizhang Luo,Guanwen Feng,Gangyong Jia,Hua Zhang,Pu Wang,Ruiquan Ge
标识
DOI:10.1089/cmb.2022.0109
摘要
Protein succinylation is a novel type of post-translational modification in recent decade years. It played an important role in biological structure and functions verified by experiments. However, it is time consuming and laborious for the wet experimental identification of succinylation sites. Traditional technology cannot adapt to the rapid growth of the biological sequence data sets. In this study, a new computational method named SuccSPred2.0 was proposed to identify succinylation sites in the protein sequences based on multifeature fusion and maximal information coefficient (MIC) method. SuccSPred2.0 was implemented based on a two-step strategy. At first, high-dimension features were reduced by linear discriminant analysis to prevent overfitting. Subsequently, MIC method was employed to select the important features binding classifiers to predict succinylation sites. From the compared experiments on 10-fold cross-validation and independent test data sets, SuccSPred2.0 obtained promising improvements. Comparative experiments showed that SuccSPred2.0 was superior to previous tools in identifying succinylation sites in the given proteins.
科研通智能强力驱动
Strongly Powered by AbleSci AI