支持向量机
阿达布思
剪接
计算机科学
模式识别(心理学)
人工智能
Boosting(机器学习)
编码(内存)
分类器(UML)
机器学习
基因
生物
遗传学
作者
Elham Pashaei,Alper Yılmaz,Mustafa Özen,Nizamettin Aydin
标识
DOI:10.1109/smc.2016.7844835
摘要
The Biological sequence data are increasing rapidly, so there is a vital need of effective method for gene detection. Predicting of splice site is an important part of gene finding. Therefore, attempts to improve the prediction accuracy of the computational methods for splice sites detection continue. In this paper we propose a hybrid algorithm for splice sites prediction by combining AdaBoost classifier with a novel nucleotide encoding method, namely FDDM. Our encoding method provides frequency difference between the true sites and false sites (FD) along with distance measure (DM). The proposed method produces an improvement in comparison with the result of current methods such as MM1-SVM, Reduced MM1-SVM, SVM-B, LVMM, DM-SVM, DM2-AdaBoost and MSC+Pos(+APR)-SVM, when applied to the HS3D dataset with repeated 10-fold cross validation. In addition, for demonstrating the stability of the method, we also applied it to NN269 dataset. The obtained results indicate that the new method is practicable and efficient.
科研通智能强力驱动
Strongly Powered by AbleSci AI