恶意软件
计算机科学
机器学习
随机森林
特征选择
人工智能
决策树
tf–国际设计公司
互联网
勒索软件
特征工程
人工神经网络
分类
多层感知器
数据挖掘
深度学习
计算机安全
万维网
期限(时间)
物理
量子力学
作者
Muhammad Azeem,Danish Ali Khan,Sadaf Iftikhar,Shaikhan Bawazeer,Mohammed Alzahrani
出处
期刊:Heliyon
[Elsevier BV]
日期:2024-01-01
卷期号:10 (1): e23574-e23574
标识
DOI:10.1016/j.heliyon.2023.e23574
摘要
The Internet has become a vital source of knowledge and communication in recent times. Continuous technological advancements have changed the way businesses operate, and everyone today lives in the digital world of engineering. Because of the Internet of Things (IoT) and its applications, people's impressions of the information revolution have improved. Malware detection and categorization are becoming more of a problem in the cybersecurity world. As a result, strong security on the Internet could protect billions of internet users from harmful behavior. In malware detection and classification techniques, several types of deep learning models are used; however, they still have limitations. This study will explore malware detection and classification elements using modern machine learning (ML) approaches, including K-Nearest Neighbors (KNN), Extra Tree (ET), Random Forest (RF), Logistic Regression (LR), Decision Tree (DT), and neural network Multilayer Perceptron (nnMLP). The proposed study uses the publicly available dataset UNSWNB15. In our proposed work, we applied the feature encoding method to convert our dataset into purely numeric values. After that, we applied a feature selection method named Term Frequency-Inverse Document Frequency (TFIDF) based on entropy for the best feature selection. The dataset is then balanced and provided to the ML models for classification. The study concludes that Random Forest, out of all tested ML models, yielded the best accuracy of 97.68 %.
科研通智能强力驱动
Strongly Powered by AbleSci AI