多项式logistic回归
分层抽样
计算机科学
简单随机抽样
逻辑回归
多项式分布
采样(信号处理)
数据建模
统计
数据挖掘
机器学习
随机森林
大数据
人工智能
数学
人口
人口学
滤波器(信号处理)
数据库
社会学
计算机视觉
作者
Rungruttikarn Moungmai
标识
DOI:10.1109/ecti-con54298.2022.9795600
摘要
Big data and data analysis are very necessary and play important role in the operation of organizations especially the future forecasting. To improve the predictive accuracy, an imbalanced data problem was considered before establishing a proper multinomial logistic regression model. Dealing with such problem, Random over sampling, Random under sampling, and Hybrid approaches were considered. Moreover, Simple random sampling and Stratified sampling techniques were applied in the data modification stage. The results revealed that the models fitting information of both imbalanced and balanced data models were similar and the model accuracies were around 70%. Although the accuracies were not different, the predictive accuracy and the precision of the minor group were increased.
科研通智能强力驱动
Strongly Powered by AbleSci AI