水准点(测量)
判别式
阿达布思
计算机科学
结合位点
序列(生物学)
金属
化学
算法
机器学习
人工智能
数据挖掘
模式识别(心理学)
支持向量机
生物化学
大地测量学
有机化学
地理
作者
Liang Qiao,Dongqing Xie
标识
DOI:10.1016/j.ab.2018.11.009
摘要
Accurately targeting metal ion-binding sites solely from protein sequences is valuable for both basic experimental biology and drug discovery studies. Although considerable progress has been made, metal ion-binding site prediction is still a challenging problem due to the small size and high versatility of the metal ions. In this paper, we develop a ligand-specific predictor called MIonSite for predicting metal ion-binding sites from protein sequences. MIonSite first employs protein evolutionary information, predicted secondary structure, predicted solvent accessibility, and conservation information calculated by Jensen-Shannon Divergence score to extract the discriminative feature of each residue. An enhanced AdaBoost algorithm is then designed to cope with the serious imbalance problem buried in the metal ion-binding site prediction, where the number of non-binding sites is far more than that of metal ion-binding sites. A new gold-standard benchmark dataset, consisting of training and independent validation subsets of Zn2+, Ca2+, Mg2+, Mn2+, Fe3+, Cu2+, Fe2+, Co2+, Na+, K+, Cd2+, and Ni2+, is constructed to evaluate the proposed MIonSite with other existing predictors. Experimental results demonstrate that the proposed MIonSite achieves high prediction performance and outperforms other state-of-the-art sequence-based predictors. The standalone program of MIonSite and corresponding datasets can be freely downloaded at https://github.com/LiangQiaoGu/MIonSite.git for academic use.
科研通智能强力驱动
Strongly Powered by AbleSci AI