计算机科学
判别式
人工智能
特征(语言学)
分割
背景(考古学)
模式识别(心理学)
像素
图像分割
相互信息
语义学(计算机科学)
图像(数学)
计算机视觉
古生物学
哲学
语言学
生物
程序设计语言
作者
Ke An,Yupei Wang,Liang Chen
标识
DOI:10.1109/tgrs.2024.3352582
摘要
Recently, semantic segmentation of remote sensing images has witnessed rapid advancement with the adoption of deep neural networks. Contextual cues, referring to the long-range correlation between pixels, are crucial for achieving accurate segmentation results, particularly for objects with less discriminative characteristics in these images. Currently, most studies are centered on incorporating contextual cues by aggregating context information at the dataset-level or image-level. However, current research often treats contextual cue modeling at the dataset-level and image-level as independent procedures, neglecting the intrinsic correlation between these two feature levels. Consequently, the obtained contextual cues are sub-optimal. This issue is particularly critical in the semantic segmentation of remote sensing images. To address this, we propose to encourage mutual interaction between dataset-level and image-level contextual cues. Firstly, we propose an interactive dataset-image context aggregation scheme to obtain complementary and consistent multi-level contextual cues. Additionally, we introduce a parallel feature interaction network that progressively extracts and fuses features across multiple layers, enabling effective integration of multi-level contexts. Furthermore, we introduce an enhanced shifted window-based cross-attention mechanism to augment model efficiency. Extensive experimental results on the widely used Vaihingen, GaoFen-2 and iSAID datasets effectively demonstrate the superiority of our proposed method over state-of-art methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI