Design And Analysis Of Insider Threat Detection And Prediction System Using Machine Learning Techniques

计算机科学 内部威胁 知情人 上传 互联网 特征选择 数据挖掘 集合(抽象数据类型) 机器学习 过程(计算) 人工智能 数据集 噪音(视频) 万维网 操作系统 图像(数学) 程序设计语言 法学 政治学
作者
Anupam Mittal,Urvashi Garg
标识
DOI:10.1109/icecct56650.2023.10179686
摘要

Data is critical for large as well as small organizations as customer trust depends upon the privacy of information maintained. The key tool that every organization uses for assessing resources is the Internet. The use of technology and the Internet comes with a cost. This cost is in the form of cyber-attacks that exist over the Internet. One of the hardest attacks to detect is the insider That occurs from within the organization. The organization must distinguish between the employers as well as the insiders. This paper purposed an optimization-based strategy for the detection of insider threats. Spider monkey optimization is applied to detect the sentiments present within the R4.2 cert data set. This data set has been generated by the university of Carnegie Mellon and is used to detect insider threats. The overall process of insider threat detection Starts with downloading the data set from github. The downloaded files have been compressed so to use them; they must be extracted. A large number of files are contained within the cert dataset. For the proposed work(SMLDA optimization), email and psychometric datasets are shortlisted. After extracting the dataset, pre-processing phase is applied. Within pre-processing, noise in terms of missing values is tackled. This is achieved by rejecting records containing null values within individual cells of the dataset. after pre-processing, dataset features are extracted using the content field of the dataset along with the natural language processing toolbox. Feature selection is performed using the Spyder monkey approach. Selection will be based on the contribution factor calculated with linear discriminant analysis. Using SMO, the highest contribution document built with LDA will be selected. In the end, the polarity of the document is calculated using the TextBlob library. The result of the SMO-driven sentiment analysis (anger, neutral, negative, positive, and sad) is compared with the plain LDA approach. SMO-driven sentiment analysis generates a classification accuracy of 99% and the LDA approach generates a classification accuracy of 90%. Furthermore, it was discovered that negative and sad sentiments most resulted in insider threats.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
ypj9777完成签到,获得积分10
1秒前
782221完成签到,获得积分10
1秒前
2秒前
Ava应助隐形丹珍采纳,获得10
4秒前
More应助Rain采纳,获得10
4秒前
小蘑菇应助李文敏采纳,获得10
7秒前
7秒前
科研通AI6.1应助skyler采纳,获得10
8秒前
空谷给跳跃的凡霜的求助进行了留言
9秒前
10秒前
充电宝应助kk99采纳,获得10
10秒前
10秒前
GM发布了新的文献求助10
10秒前
冰淇淋真凉完成签到,获得积分10
11秒前
NexusExplorer应助坦率夜山采纳,获得30
12秒前
12秒前
15秒前
17秒前
69qq完成签到,获得积分10
17秒前
17秒前
linyufeifei发布了新的文献求助10
18秒前
孙小子完成签到,获得积分10
18秒前
19秒前
眯眯眼的山柳完成签到,获得积分10
19秒前
文艺的续完成签到 ,获得积分10
21秒前
拉长的高山完成签到,获得积分10
22秒前
科研通AI6.1应助p454q采纳,获得10
22秒前
wang完成签到 ,获得积分10
22秒前
23秒前
kk99发布了新的文献求助10
23秒前
23秒前
24秒前
25秒前
26秒前
26秒前
Lotus完成签到,获得积分10
26秒前
张锐斌完成签到,获得积分10
27秒前
desmond发布了新的文献求助40
28秒前
linyufeifei完成签到,获得积分10
29秒前
29秒前
高分求助中
论现代体育科学研究的方法学特征 1000
Invited Discussant 63O and 64O 1000
Ideology and Meaning-Making under the Putin Regime 750
Prompt Engineering for Clinicians: Harnessing AI in Everyday Medical Practice 600
Safety Pharmacology 500
《KNN基无铅压电陶瓷电学性能优化与物理机理研究》 500
A Handbook of User Experience Research & Design in Libraries 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 计算机科学 化学工程 生物化学 物理 内科学 复合材料 催化作用 光电子学 物理化学 电极 细胞生物学 基因 遗传学
热门帖子
关注 科研通微信公众号,转发送积分 6918787
求助须知:如何正确求助?哪些是违规求助? 8609298
关于积分的说明 18265460
捐赠科研通 6333115
什么是DOI,文献DOI怎么找? 3069308
关于科研通互助平台的介绍 2098681
邀请新用户注册赠送积分活动 2046573