可解释性
计算机科学
机器学习
人工智能
逻辑回归
超参数
广义线性模型
多项式logistic回归
计量经济学
数据挖掘
数学
作者
Kevin McDonnell,Finbarr Murphy,Barry Sheehan,Leandro Masello,German Castignani
标识
DOI:10.1016/j.eswa.2023.119543
摘要
Generalized Linear Models (GLMs) and XGBoost are widely used in insurance risk pricing and claims prediction, with GLMs dominant in the insurance industry. The increasing prevalence of connected car data usage in insurance requires highly accurate and interpretable models. Deep learning (DL) models have outperformed traditional Machine Learning (ML) models in multiple domains; despite this, they are underutilized in insurance risk pricing. This study introduces an alternative DL architecture, TabNet, suitable for insurance telematics datasets and claim prediction. This approach compares the TabNet DL model against XGBoost and Logistic Regression on the task of claim prediction on a synthetic telematics dataset. TabNet outperformed these models, providing highly interpretable results and capturing the sparsity of the claims data with high accuracy. However, TabNet requires considerable running time and effort in hyperparameter tuning to achieve these results. Despite these limitations, TabNet provides better pricing models for interpretable models in insurance when compared to XGBoost and Logistic Regression models.
科研通智能强力驱动
Strongly Powered by AbleSci AI