可解释性
计算机科学
Boosting(机器学习)
入侵检测系统
梯度升压
机器学习
人工智能
水准点(测量)
异常检测
数据挖掘
推论
算法
随机森林
大地测量学
地理
作者
Merve Nur Yılmaz,Batuhan Bardak
标识
DOI:10.1109/asyu56188.2022.9925451
摘要
Nowadays, protecting computer systems by preventing malicious network attacks is a vital topic. In recent years, ma-chine learning-based network intrusion detection systems (NIDS) started showing effective results. While the task of classifying cyber attacks in NIDS has been studied extensively in the literature, there is no comprehensive benchmark study with gradient boosting algorithms on recent open-source datasets. This paper aims to evaluate different gradient boosting-based algorithm performances including XGBoost, CatBoost, and LightGBM on different open-source NIDS datasets such as CIC-IDS2017, CSE-CIC-IDS2018, and INSDN. Furthermore, the SHapley Additive exPlanations (SHAP) is applied to increase the interpretability of the models and investigate the relationship between cyber attacks and the network features. Our experimental results demonstrate that the XGBoost model consistently outperforms other comparative models in F1 score for all datasets. At the same time, we compare the training/inference time of different gradient boosting algorithms which is an important constraint for real-time intrusion detection systems. Moreover, the different important features between different datasets can help data sci-entists for designing better artificial intelligence-based intrusion detection algorithms.
科研通智能强力驱动
Strongly Powered by AbleSci AI