The coefficient of determination R-squared is more informative than SMAPE, MAE, MAPE, MSE and RMSE in regression analysis evaluation

均方误差 统计 数学 回归分析 平均绝对百分比误差 公制(单位) 回归 航程(航空) 决定系数 线性回归 基本事实 二进制数 计算机科学 人工智能 算术 运营管理 材料科学 经济 复合材料
作者
Davide Chicco,Matthijs J. Warrens,Giuseppe Jurman
出处
期刊:PeerJ [PeerJ, Inc.]
卷期号:7: e623-e623 被引量:4703
标识
DOI:10.7717/peerj-cs.623
摘要

Regression analysis makes up a large part of supervised machine learning, and consists of the prediction of a continuous independent target from a set of other predictor variables. The difference between binary classification and regression is in the target range: in binary classification, the target can have only two values (usually encoded as 0 and 1), while in regression the target can have multiple values. Even if regression analysis has been employed in a huge number of machine learning studies, no consensus has been reached on a single, unified, standard metric to assess the results of the regression itself. Many studies employ the mean square error (MSE) and its rooted variant (RMSE), or the mean absolute error (MAE) and its percentage variant (MAPE). Although useful, these rates share a common drawback: since their values can range between zero and +infinity, a single value of them does not say much about the performance of the regression with respect to the distribution of the ground truth elements. In this study, we focus on two rates that actually generate a high score only if the majority of the elements of a ground truth group has been correctly predicted: the coefficient of determination (also known as R -squared or R 2 ) and the symmetric mean absolute percentage error (SMAPE). After showing their mathematical properties, we report a comparison between R 2 and SMAPE in several use cases and in two real medical scenarios. Our results demonstrate that the coefficient of determination ( R -squared) is more informative and truthful than SMAPE, and does not have the interpretability limitations of MSE, RMSE, MAE and MAPE. We therefore suggest the usage of R -squared as standard metric to evaluate regression analyses in any scientific domain.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
科目三应助跳跃的大碗采纳,获得10
1秒前
1秒前
1秒前
meeteryu完成签到,获得积分10
1秒前
2秒前
gelinhao完成签到,获得积分10
2秒前
抗体药物偶联完成签到,获得积分10
3秒前
资富发布了新的文献求助10
3秒前
勤恳的猫发布了新的文献求助10
3秒前
LIO完成签到 ,获得积分10
4秒前
5秒前
altair发布了新的文献求助20
5秒前
5秒前
FashionBoy应助汉堡采纳,获得10
6秒前
852应助Kevin采纳,获得10
6秒前
7秒前
Daisy123k发布了新的文献求助10
7秒前
7秒前
是符不是嚯完成签到 ,获得积分10
7秒前
322334完成签到 ,获得积分10
7秒前
爆米花应助无奈的画板采纳,获得10
8秒前
迟迟发布了新的文献求助10
8秒前
丘比特应助KX2024采纳,获得10
8秒前
丘比特应助平常书兰采纳,获得10
9秒前
来自山灵的风完成签到,获得积分10
9秒前
桃子发布了新的文献求助10
9秒前
10秒前
CodeCraft应助歇息下采纳,获得10
10秒前
126960发布了新的文献求助10
10秒前
爆米花应助小巧大山采纳,获得10
10秒前
11秒前
科研通AI6.3应助1111采纳,获得10
11秒前
11秒前
星辰大海应助李宁采纳,获得10
12秒前
香蕉斓发布了新的文献求助10
12秒前
Yi完成签到,获得积分10
13秒前
14秒前
14秒前
烟花应助科研通管家采纳,获得10
14秒前
CodeCraft应助科研通管家采纳,获得10
14秒前
高分求助中
Principles of Economics, 11th Edition 10000
Prescott's Microbiology: 2026 Release ISE 10000
University Physics with Modern Physics, 16th edition 10000
Cronologia da história de Macau 5000
Environmental Leverage in Times of Climate Crisis: Product Standards, Carbon Border Measures and Preferential Trade Agreements 1000
Interactions of Vowel Quality and Prosody in East Slavic 1000
Erwählung und Berufung bei Paulus: Bedeutung, Entwicklung und Funktion einer Vorstellung in ihrem frühjüdischen und griechisch-römischen Kontext 850
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 内科学 物理 复合材料 催化作用 细胞生物学 无机化学 光电子学 物理化学 电极 基因
热门帖子
关注 科研通微信公众号,转发送积分 7153058
求助须知:如何正确求助?哪些是违规求助? 8798258
关于积分的说明 18593507
捐赠科研通 6751910
什么是DOI,文献DOI怎么找? 3160357
关于科研通互助平台的介绍 2293838
邀请新用户注册赠送积分活动 2134955