Learning Frequency-Domain Fusion for Multimodal Remote Sensing Semantic Segmentation

遥感计算机科学特征（语言学）模态（人机交互）分割人工智能语义映射钥匙（锁）图像融合图像分割计算机视觉传感器融合融合遥感应用方案（数学）特征学习可扩展性模式解耦（概率）编码（集合论）代表（政治）语义学（计算机科学）特征向量特征提取语义鸿沟编码（内存）模式识别（心理学）空间分析

作者

Guangsheng Chen,Fangyu Sun,Weipeng Jing,Weitao Zou,Donglin Di,Yang Song,Lei Fan

出处

期刊：IEEE Transactions on Geoscience and Remote Sensing [Institute of Electrical and Electronics Engineers]
日期：2025-01-01 卷期号：63: 1-16 被引量：2

标识

DOI：10.1109/tgrs.2025.3622749

摘要

Multimodal remote sensing data substantially enhance semantic segmentation accuracy by providing complementary information across sensing modalities. However, fully exploiting and effectively fusing features from different modalities to capture comprehensive semantic representations remains a challenge. Most existing methods restrict interactions to the spatial domain, making their representations vulnerable to heterogeneity arising from distinct imaging mechanisms. To address these issues, we propose FDMF-Net, a Frequency-domain Decoupled Modality Fusion network. Our approach comprises three key modules: the Amplitude Spectrum Decoupling module (ASD), the Modality Enhancement module (ME), and the Low-Frequency-Guided Feature Fusion module (LFGF), dedicated to extracting, enhancing, and fusing modality-invariant and specific representations, respectively. The ASD module performs frequency-domain decomposition to separate modality-invariant and modality-specific features, promoting more effective cross-modal complementarity. The ME module introduces a mutual information-based feature enhancement scheme to obtain more robust modality-invariant and modality-specific representations, thereby improving feature discriminability. The LFGF module, based on an attention mechanism, fuses shared and specific representations to generate feature maps with richer semantic information. Extensive evaluations on multiple standard multimodal remote sensing datasets demonstrate that FDMF-Net achieves state-of-the-art accuracy across several benchmarks. The code is available at https://github.com/fy-sun/FDMF-Net.

求助该文献

最长约 10秒，即可获得该文献文件

Learning Frequency-Domain Fusion for Multimodal Remote Sensing Semantic Segmentation

今日热心研友