遥感
计算机科学
特征(语言学)
模态(人机交互)
分割
人工智能
语义映射
钥匙(锁)
图像融合
图像分割
计算机视觉
传感器融合
融合
遥感应用
方案(数学)
特征学习
可扩展性
模式
解耦(概率)
编码(集合论)
代表(政治)
语义学(计算机科学)
特征向量
特征提取
语义鸿沟
编码(内存)
模式识别(心理学)
空间分析
作者
Guangsheng Chen,Fangyu Sun,Weipeng Jing,Weitao Zou,Donglin Di,Yang Song,Lei Fan
标识
DOI:10.1109/tgrs.2025.3622749
摘要
Multimodal remote sensing data substantially enhance semantic segmentation accuracy by providing complementary information across sensing modalities. However, fully exploiting and effectively fusing features from different modalities to capture comprehensive semantic representations remains a challenge. Most existing methods restrict interactions to the spatial domain, making their representations vulnerable to heterogeneity arising from distinct imaging mechanisms. To address these issues, we propose FDMF-Net, a Frequency-domain Decoupled Modality Fusion network. Our approach comprises three key modules: the Amplitude Spectrum Decoupling module (ASD), the Modality Enhancement module (ME), and the Low-Frequency-Guided Feature Fusion module (LFGF), dedicated to extracting, enhancing, and fusing modality-invariant and specific representations, respectively. The ASD module performs frequency-domain decomposition to separate modality-invariant and modality-specific features, promoting more effective cross-modal complementarity. The ME module introduces a mutual information-based feature enhancement scheme to obtain more robust modality-invariant and modality-specific representations, thereby improving feature discriminability. The LFGF module, based on an attention mechanism, fuses shared and specific representations to generate feature maps with richer semantic information. Extensive evaluations on multiple standard multimodal remote sensing datasets demonstrate that FDMF-Net achieves state-of-the-art accuracy across several benchmarks. The code is available at https://github.com/fy-sun/FDMF-Net.
科研通智能强力驱动
Strongly Powered by AbleSci AI