随机森林
特征选择
朴素贝叶斯分类器
支持向量机
人工智能
计算机科学
逻辑回归
人工神经网络
特征(语言学)
统计分类
模式识别(心理学)
机器学习
选择(遗传算法)
数据挖掘
肝细胞癌
特征提取
生物
哲学
癌症研究
语言学
作者
Moh Abdul Latief,Titin Siswantining,Alhadi Bustamam,Devvi Sarwinda
标识
DOI:10.1109/icicos48119.2019.8982435
摘要
Hepatocellular carcinoma is one of the cancers that cause death in the world. We get hepatocellular carcinoma data in the form of microarray data gene expression obtained from the National Center for Biotechnology Information website consisting of 40 samples and 54675 features. The main purpose of this research is to compare the performance evaluation of Hepatocellular Carcinoma by applying feature selection to several classification algorithms. Random Forest feature selection method will be paired with several classification algorithms such as Support Vector Classification, Neural Network Classification, Random Forest, Logistic Regression, and Naïve Bayes. This study uses 5-fold cross-validation as an evaluation method. The results showed that Random Forest algorithm, Neural Network, Vector Machine Classification, and Naive Bayes show higher classification performance evaluation than without using random forest feature selection, while the Logistic Regression model provides a higher performance evaluation without using Random Forest feature selection. Support Vector Classification offers the highest performance evaluation compared to four other algorithms using feature selection, but Logistic Regression provides higher performance evaluation compared to different classification algorithms without feature selection.
科研通智能强力驱动
Strongly Powered by AbleSci AI