计算机科学
特征提取
人工智能
RGB颜色模型
特征(语言学)
语义学(计算机科学)
模式识别(心理学)
一致性(知识库)
图形
邻接表
卷积神经网络
深度学习
计算机视觉
算法
哲学
理论计算机科学
程序设计语言
语言学
作者
Jinxin Yang,Wujie Zhou,Ruiming Wu,Meixin Fang
标识
DOI:10.1109/lsp.2023.3299218
摘要
Rail surface defect detection for traffic safety has received considerable attention. With the development of deep learning, numerous methods for combining RGB and depth information have been proposed. However, these methods directly fuse raw features extracted from the backbone, which can lead to ineffective use of the complementary information of the two modalities. In this study, we developed a contour and semantic feature alignment fusion network (CSANet) with bidirectional feature alignment to explore the internal consistency of cross-modal features from both contour and semantic perspectives. First, an adjacency contour feature extraction module was designed to capture high-quality contour information from adjacent low-level features. Second, an attention-aware graph convolution embedded semantic feature extraction module was designed to explore long-range dependencies and extract semantic information. Third, a bidirectional alignment mechanism was designed to explore the internal consistency of contours and semantics between bimodal features. Experimental results on the industrial RGB-D dataset (NEU RSDDS-AUG) revealed that the proposed CSANet outperformed 12 state-of-the-art algorithms in four evaluation metrics.
科研通智能强力驱动
Strongly Powered by AbleSci AI