计算机科学
人工智能
异常检测
建筑
变压器
编码器
计算机视觉
模式识别(心理学)
图像(数学)
绘图
异常(物理)
计算机图形学(图像)
工程类
地理
物理
电气工程
操作系统
电压
考古
凝聚态物理
作者
Mengting Zhang,Xiuxia Tian
标识
DOI:10.1016/j.vrih.2022.07.006
摘要
Image anomaly detection is a popular task in computer graphics, which is widely used in industrial fields. Previous works that address this problem often train CNN-based (e.g. Auto-Encoder, GANs) models to reconstruct covered parts of input images and calculate the difference between the input and the reconstructed image. However, convolutional operations are good at extracting local features making it difficult to identify larger image anomalies. To this end, we propose a transformer architecture based on mutual attention for image anomaly separation. This architecture can capture long-term dependencies and fuse local features with global features to facilitate better image anomaly detection. Our method was extensively evaluated on several benchmarks, and experimental results showed that it improved detection capability by 3.1% and localization capability by 1.0% compared with state-of-the-art reconstruction-based methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI