加权
公制(单位)
计算机科学
相关性(法律)
回归
均方误差
数据挖掘
人工智能
任务(项目管理)
性能指标
回归分析
机器学习
统计
算法
数学
医学
运营管理
管理
政治学
法学
经济
放射科
摘要
Abstract Many real‐world data mining applications involve using imbalanced datasets to obtain predictive models. Imbalanced data can hinder the model performance of learning algorithms in rare cases. Although there are many well‐researched classification task solutions, most of them cannot be directly applied to regression task. One of the challenges in imbalanced regression is to find a suitable evaluation and optimization standard that can improve the predictive ability of the model without severe model bias. Based on the importance of rare cases, this study proposes a new evaluation metric called adapted squared error relevance (ASER) by defining new relevance function and weighting functions. This metric weights data points by defining the importance of rare cases and assigns different weights to losses of the same size at different rare cases, thus enabling the model selected by this evaluation metric to better predict rare cases. ASER is compared with SER on 32 real datasets and 9 simulated datasets to verify the predictive performance of the selected model at rare cases. The experimental results show that the new evaluation metric ASER can obtain a high prediction performance at rare cases, while also not losing too much prediction accuracy in common cases.
科研通智能强力驱动
Strongly Powered by AbleSci AI