计算机科学
变压器
嵌入
卷积(计算机科学)
卷积神经网络
深度学习
人工智能
模式识别(心理学)
工程类
电压
电气工程
人工神经网络
作者
Cheng Wang,Haibing Liu,Xiaoya An,Zhiqun Gong,Fei Deng
标识
DOI:10.1016/j.dsp.2023.104297
摘要
By leveraging deep learning methods, pavement crack detection can be more automatic, efficient, and accurate than manual inspection. To solve the problem of limited receptive field in pure CNN-based crack detection networks, we proposed an end-to-end detection network based on Swin-Transformer, called SwinCrack. SwinCrack can produce more accurate and continuous descriptions of pavement cracks by modeling long-range interactions and adaptive spatial aggregation compared to CNN-based detection models. Furthermore, to delineate crisp and accurate crack boundaries, we introduced convolution operations to Swin-Transformer for more local and detailed crack information. Convolutional Patch Embedding Layer (CPEL), Convolutional Swin-Transformer Block (CSTB), and Depth-convolution Forward Network (DFN) are proposed and embedded into SwinCrack to capture more spatial contexts. Also, Convolutional Attention Gated Skip Connection (CAGSC) is designed to suppress background interference in low-level features. Furthermore, five evaluation experiments on SwinCrack and an ablation study on the four proposed modules are performed. The attention maps of the SwinCrack are visualized to give a better insight into the contribution of each convolutional module embedded. Evaluation results show that SwinCrack gains OIS values of 0.781 to 0.849 and a maximum 4.4% improvement on OIS among the six public crack datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI