异常检测
计算机科学
可扩展性
卷积神经网络
人工智能
变压器
判别式
语义学(计算机科学)
可视化
深度学习
背景(考古学)
模式识别(心理学)
数据挖掘
机器学习
工程类
古生物学
电压
数据库
电气工程
生物
程序设计语言
作者
Haiming Yao,Wei Luo,Jianan Lou,Wenyong Yu,Xiaotian Zhang,Zhenfeng Qiang,Hui Shi
标识
DOI:10.1109/tim.2023.3343832
摘要
In recent years, the field of industrial visual anomaly detection has attracted significant attention in the context of advanced smart manufacturing systems. However, several limitations remain unresolved in existing approaches. While these methods can achieve satisfactory performance when training separate models for different categories, their scalability and performance suffer when faced with the challenge of simultaneous training for multiple categories. Reconstruction-based methods generally suffer from the identical mapping problem. To address these limitations, this study introduces the Partial Semantic Aggregation Vision Transformer (PSA-VT), a scalable framework for industrial visual anomaly detection that enables simultaneous multi-category anomaly detection using a single model. Our proposed PSA-VT framework adopts a hybrid design strategy. Firstly, a pre-trained convolutional neural network (CNN) is employed to extract multi-scale discriminative local representation. Subsequently, the partial semantic aggregation Vision transformer(PSA-VT) is introduced to perform representation reconstruction through long-range global semantic aggregation. Finally, the anomalous properties can be estimated by evaluating the reconstruction error of the representations. We conducted extensive experiments using the Mvtec AD industrial anomaly detection dataset, as well as the semantic anomaly detection datasets. The experimental results demonstrate that our method achieves state-of-the-art performance by capturing high-level semantics. Notably, PSA-VT surpasses other methods for the 1-model-15-category anomaly detection tasks on the Mvtec AD dataset. Furthermore, we applied incremental learning techniques to enable the rapid deployment of PSA-VT in a real industrial scenario.
科研通智能强力驱动
Strongly Powered by AbleSci AI