Default prediction based on a locally weighted dynamic ensemble model for imbalanced data
计算机科学
集合预报
数据挖掘
人工智能
作者
Xing Jin,Guotai China,Ancheng Pan
出处
期刊:The Journal of Risk Model Validation日期:2024-01-01
标识
DOI:10.21314/jrmv.2023.012
摘要
Default prediction plays a decisive role in the credit decisions of financial institutions. To avoid the bias that can occur in model predictions due to differences in the numbers of defaulting and nondefaulting firms, this study proposes a locally weighted dynamic ensemble model. To construct more diverse base classifiers, ten imbalanced data sampling methods and five heterogeneous classifiers are introduced to balanced bagging to select the base classifiers with the highest accuracy under different data distributions. To reduce overfitting and information loss, the locally weighted dynamic ensemble method is used to obtain the final prediction result. Experiments on three publicly available data sets and a data set of Chinese listed firms validate that the predictive performance of the proposed ensemble model is superior to that of three other heterogeneous ensemble models, seven homogeneous ensemble models and five individual models in predicting the imbalanced data. Moreover, the proposed ensemble model can predict financial institutions' default status five years ahead.