夏普里值
变量(数学)
公理
回归
回归分析
价值(数学)
功能(生物学)
计量经济学
变量
计算机科学
数学优化
数学
数理经济学
统计
机器学习
博弈论
生物
数学分析
进化生物学
几何学
出处
期刊:Metals
[MDPI AG]
日期:2022-10-22
卷期号:12 (11): 1777-1777
被引量:6
摘要
Shapley value regression with machine learning models has recently emerged as an axiomatic approach to the development of diagnostic models. However, when large numbers of predictor variables have to be considered, these methods become infeasible, owing to the inhibitive computational cost. In this paper, an approximate Shapley value approach with random forests is compared with a full Shapley model, as well as other methods used in variable importance analysis. Three case studies are considered, namely one based on simulated data, a model predicting throughput in a calcium carbide furnace as a function of operating variables, and a case study related to energy consumption in a steel plant. The approximately Shapley approach achieved results very similar to those achieved with the full Shapley approach but at a fraction of the computational cost. Moreover, although the variable importance measures considered in this study consistently identified the most influential predictors in the case studies, they yielded different results when fewer influential predictors were considered, and none of the variable importance measures performed better than the other measures across all three case studies.
科研通智能强力驱动
Strongly Powered by AbleSci AI