计算机科学
人工智能
特征提取
模式识别(心理学)
变压器
特征学习
特征(语言学)
目标检测
工程类
电压
语言学
电气工程
哲学
作者
Hantao Zhou,Rui Yang,Runze Hu,Chang Shu,X. Tang,Xiu Li
标识
DOI:10.1109/tim.2023.3307753
摘要
Deep learning-based surface defect detectors play a crucial role in ensuring product quality during inspection processes. However, accurately and efficiently detecting defects remains challenging due to specific characteristics inherent in defective images, involving a high degree of foreground-background similarity, scale variation, and shape variation. To address this challenge, we propose an efficient Transformer-based detection network, ETDNet, consisting of three novel designs to achieve superior performance. Firstly, ETDNet takes a lightweight Vision Transformer to extract representative global features. This approach ensures an accurate feature characterization of defects even with similar backgrounds. Secondly, a channel-modulated feature pyramid network (CM-FPN) is devised to fuse multi-level features and maintain critical information from corresponding levels. Lastly, a novel task-oriented decoupled (TOD) head is introduced to tackle inconsistent representation between classification and regression tasks. The TOD head employs a local feature representation module to learn object-aware local features and introduces a global feature representation module, based on the attention mechanism, to learn content-aware global features. By integrating these two modules into the head, ETDNet can effectively classify and perceive defects with varying shapes and scales. Extensive experiments on various defect detection datasets demonstrate the effectiveness of the proposed ETDNet. For instance, it achieves AP 46.7% (v.s. 45.9%), and AP 50 80.2% (v.s. 79.1%) with 49 FPS on NeU-DET. Code is available at https://github.com/zht8506/ETDNet.
科研通智能强力驱动
Strongly Powered by AbleSci AI