人工智能
棱锥(几何)
计算机科学
深度学习
分割
计算机视觉
模式识别(心理学)
联营
数学
几何学
作者
Enhui Gu,Gang Xiao,Faming Lian,Tongyao Mu,Jie Hong,Jun Liu
标识
DOI:10.1109/tim.2023.3272052
摘要
Jet fuel leaks not only waste resources and increase costs but also pose a risk of emergency landings and aviation accidents. With the blossom and implementation of deep learning, crack segmentation techniques have been rapidly developed in many fields. However, it is struggle to get accurate and complete crack segmentation results because of images' complex background environment. To address this issue, we collected and labeled a dataset of 2824 crack images from the surface of aircraft fuel tank, named CAFT2800. And this article presents an atrous spatial pyramid fusion and hybrid attention network based on deep learning to deal with the complicated environment. The backbone of network uses a hierarchical structure Swin Transformer to extract features. In the neck of the network, an atrous spatial pyramid fusion module is proposed to further capture contextual information in multiple scale. Unlike the ASPP module, which aggregates image-level information with pooling, we abandon this operation according to the characteristics of crack. And the neck structure is designed for fusing high-level semantic information to all scales. At the gate between the neck and the decoder head, a SK (selective kernel) block is embedded into the network to recalibrates channel-wise feature responses. Due to the morphological characteristics of crack, we propose an evaluation index, Thinning F1 score (TFscore), which is more meaningful compared to the commonly used F1 score. Sufficient control experiments were conducted on the CAFT2800 dataset and two complicated environment benchmarks (DeepCrack and GAPs) to test the effectiveness of the network, and our method achieved superior performance. Source code and the CAFT2800 are available at https://github.com/Gu-EH/CAFT2800.
科研通智能强力驱动
Strongly Powered by AbleSci AI