计算机科学
范畴变量
粒子群优化
Boosting(机器学习)
机器学习
适应度函数
人工智能
二元分类
功能(生物学)
任务(项目管理)
支持向量机
遗传算法
管理
进化生物学
经济
生物
作者
Zongwen Fan,Jianping Gou,Cheng Wang
标识
DOI:10.1016/j.engappai.2023.106649
摘要
Knowing the potential students who will fail the final exam at early stages is very challenging but important for the decision-makers in the educational institutions to take proper actions to prevent them from failure. To accurately predict the secondary school student performance, we propose a double particle swarm optimization (PSO)-based categorical boosting (P2CatBoost) model based on the demographic, school period grades, and social/school related features. Considering the machine learning models are sensitive to their hyper-parameter settings, we introduce the PSO to optimize the fitness function. In addition, the threshold of a standard binary classification task is 0.5, which might not be the optimal value in real-world applications. Thus, we optimize this threshold by the PSO. To evaluate the performance of our proposed model, two datasets downloaded from the University of California, Irvine Repository and the Kaggle, respectively, are used. The experimental results showed that our proposed P2CatBoost has the best performance in terms of all the metrics used. Our proposed P2CatBoost has the best accuracy of 96.62% and 94.45% for the final grade prediction of the Mathematics and Portuguese courses, respectively. In addition, our proposed model outperforms the other models under comparison from 4.5% to 8.3%. The statistical analyses verify that our P2CatBoost can significantly outperform the comparing models. These results confirm the effectiveness of our double PSO for improving the performance of student performance prediction, indicating our proposed model could be a useful tool in educational institutions to improve the quality of education.
科研通智能强力驱动
Strongly Powered by AbleSci AI