A Hybrid Deep Learning-Based Unsupervised Anomaly Detection in High Dimensional Data

自编码 计算机科学 维数之咒 异常检测 人工智能 人工神经网络 深度学习 机器学习 功能(生物学) 模式识别(心理学) 随机梯度下降算法 数据挖掘 进化生物学 生物
作者
Amgad Muneer,Shakirah Mohd Taib,Suliman Mohamed Fati,Abdullateef Oluwagbemiga Balogun,Izzatdin Abdul Aziz
出处
期刊:Computers, materials & continua 卷期号:70 (3): 5363-5381 被引量:12
标识
DOI:10.32604/cmc.2022.021113
摘要

Anomaly detection in high dimensional data is a critical research issue with serious implication in the real-world problems. Many issues in this field still unsolved, so several modern anomaly detection methods struggle to maintain adequate accuracy due to the highly descriptive nature of big data. Such a phenomenon is referred to as the “curse of dimensionality” that affects traditional techniques in terms of both accuracy and performance. Thus, this research proposed a hybrid model based on Deep Autoencoder Neural Network (DANN) with five layers to reduce the difference between the input and output. The proposed model was applied to a real-world gas turbine (GT) dataset that contains 87620 columns and 56 rows. During the experiment, two issues have been investigated and solved to enhance the results. The first is the dataset class imbalance, which solved using SMOTE technique. The second issue is the poor performance, which can be solved using one of the optimization algorithms. Several optimization algorithms have been investigated and tested, including stochastic gradient descent (SGD), RMSprop, Adam and Adamax. However, Adamax optimization algorithm showed the best results when employed to train the DANN model. The experimental results show that our proposed model can detect the anomalies by efficiently reducing the high dimensionality of dataset with accuracy of 99.40%, F1-score of 0.9649, Area Under the Curve (AUC) rate of 0.9649, and a minimal loss function during the hybrid model training.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
完美世界应助yihuifa采纳,获得10
刚刚
shengdong完成签到,获得积分10
刚刚
1秒前
2秒前
领导范儿应助wangwang采纳,获得10
2秒前
美好书瑶发布了新的文献求助10
4秒前
开昕完成签到,获得积分10
5秒前
XXXX完成签到,获得积分10
6秒前
二三发布了新的文献求助10
7秒前
抹颜完成签到 ,获得积分10
7秒前
ding应助美好书瑶采纳,获得10
12秒前
若白Carey完成签到 ,获得积分10
17秒前
17秒前
20秒前
22秒前
22秒前
chen发布了新的文献求助10
27秒前
城南徐师傅完成签到,获得积分10
28秒前
29秒前
朝暮里应助caiqinghua888888采纳,获得10
31秒前
美好书瑶完成签到,获得积分20
32秒前
chen完成签到,获得积分10
33秒前
MADKAI发布了新的文献求助10
33秒前
李健的粉丝团团长应助czb采纳,获得10
33秒前
34秒前
34秒前
MM完成签到,获得积分10
36秒前
MADKAI发布了新的文献求助10
37秒前
39秒前
40秒前
研友_VZG7GZ应助科研通管家采纳,获得10
41秒前
天天快乐应助科研通管家采纳,获得10
41秒前
小倩应助科研通管家采纳,获得20
41秒前
41秒前
41秒前
43秒前
科里斯皮尔应助小兵采纳,获得10
43秒前
小怨种发布了新的文献求助10
46秒前
47秒前
47秒前
高分求助中
Un calendrier babylonien des travaux, des signes et des mois: Séries iqqur îpuš 1036
IG Farbenindustrie AG and Imperial Chemical Industries Limited strategies for growth and survival 1925-1953 800
The Found Generation: Chinese Communists in Europe during the Twenties 700
Sustainable Land Management: Strategies to Cope with the Marginalisation of Agriculture 600
麦可思2024版就业蓝皮书 500
Handbook of Language Analysis in Psychology 500
Prochinois Et Maoïsmes En France (et Dans Les Espaces Francophones) 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2538005
求助须知:如何正确求助?哪些是违规求助? 2172880
关于积分的说明 5587232
捐赠科研通 1893302
什么是DOI,文献DOI怎么找? 943950
版权声明 565190
科研通“疑难数据库(出版商)”最低求助积分说明 502860