主成分分析
计算机科学
特征选择
支持向量机
人工智能
人工神经网络
遗传算法
Boosting(机器学习)
模式识别(心理学)
机器学习
数据挖掘
作者
Tuncay Özcan,Ebru Pekel Özmen
标识
DOI:10.1142/s0218213023400092
摘要
Cardiovascular diseases are one of the most common causes of death in the world. At this point, early diagnosis of heart diseases is critically important. The aim of this study is to predict the heart disease using feature selection, classification and optimization algorithms. Firstly, principal component analysis (PCA) is used to create the feature selection model and to determine the effective attributes. Then, Extreme Gradient Boosting (XGBoost) classification model is proposed to predict the heart disease. Finally, genetic algorithm (GA) is applied to optimize the parameters of XGBoost to improve the classification accuracy. The developed hybrid PCA-XGBoost-GA approach is compared with XGBoost, PCA-XGBoost, XGBoost-GA, artificial neural network (ANN) and support vector machine (SVM). The effectiveness of these approaches is illustrated with a case study with the actual data taken from a university hospital in Turkey. The numerical results show that the proposed PCA-XGBoost-GA model outperforms the other classification models in terms of accuracy rate, recall, precision and F-measure. Moreover, feature selection and parameter optimization improve the classification performance of the XGBoost model.
科研通智能强力驱动
Strongly Powered by AbleSci AI