FTransDeepLab: Multimodal Fusion Transformer-Based DeepLabv3+ for Remote Sensing Semantic Segmentation

计算机科学 分割 融合 人工智能 计算机视觉 遥感 模式识别(心理学) 地质学 哲学 语言学
作者
Haixia Feng,Qingwu Hu,Pengcheng Zhao,Shunli Wang,Mingyao Ai,Daoyuan Zheng,Tiancheng Liu
出处
期刊:IEEE Transactions on Geoscience and Remote Sensing [Institute of Electrical and Electronics Engineers]
卷期号:63: 1-18 被引量:19
标识
DOI:10.1109/tgrs.2025.3553478
摘要

High-resolution remote sensing images contain rich color and texture information, but due to the inherent limitations of 2-D data, achieving high-quality semantic segmentation remains a challenge. Multimodal data fusion technology has emerged as an effective approach to overcome this issue. To accurately capture the semantic information in remote sensing images, this study designs a multimodal fusion Transformer-based DeepLabv3+ model for remote sensing semantic segmentation, named FTransDeepLab. Specifically, the network learns features from two modalities and is inspired by the DeepLab architecture. We extended the encoder by stacking the multiscale Segformer, encoding the input images into highly representative spatial features. Additionally, we introduced the multimodal feature rectification (MFR) module and the multimodal feature fusion (MFF) module. The MFR, composed of a channel attention module and a spatial attention module, enhances the model’s ability to capture essential features and improves performance by focusing on both global and local contexts. The MFF module utilizes a cross-attention mechanism to optimize the feature fusion process, which enhances representation learning by facilitating the interaction between diverse information and integrates features from different modalities. Finally, in the decoding path, the extracted high-level features are concatenated with low-level features to optimize the feature representation and upsampled to restore the size of input image. Extensive results on two datasets, the International Society for Photogrammetry and Remote Sensing (ISPRS) Vaihingen and Potsdam, have confirmed that the proposed FTransDeepLab can achieve superior performance compared to the state-of-the-art segmentation methods.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
山野村夫完成签到,获得积分10
刚刚
QTyc2026完成签到,获得积分10
1秒前
xnz发布了新的文献求助10
1秒前
quanhua完成签到,获得积分10
1秒前
loomcool完成签到,获得积分10
2秒前
2秒前
胡胡嘉嘉磊磊完成签到,获得积分10
2秒前
行走De太阳花完成签到,获得积分10
2秒前
wodeqiche2007发布了新的文献求助10
2秒前
崔雨禾完成签到 ,获得积分10
3秒前
晴空完成签到,获得积分10
3秒前
Lvy完成签到,获得积分0
3秒前
七十三度完成签到,获得积分10
3秒前
周灏烜完成签到,获得积分10
4秒前
取名叫做利完成签到 ,获得积分10
4秒前
小宋完成签到,获得积分10
5秒前
xu完成签到,获得积分10
6秒前
csy完成签到,获得积分10
6秒前
6秒前
学海星辰应助不安青牛采纳,获得50
6秒前
石敢当完成签到,获得积分10
7秒前
喷火娃应助Mr.Ren采纳,获得10
7秒前
Running完成签到 ,获得积分10
8秒前
可可西里完成签到 ,获得积分10
8秒前
cjq发布了新的文献求助30
8秒前
幽默的煎饼完成签到,获得积分10
8秒前
小尘埃完成签到,获得积分0
9秒前
开心的抽屉完成签到,获得积分10
9秒前
贺呵呵完成签到,获得积分10
9秒前
大力的觅松完成签到,获得积分10
9秒前
MeiyanZou完成签到,获得积分10
9秒前
Air云完成签到,获得积分0
10秒前
平淡的翅膀完成签到 ,获得积分10
10秒前
zhazha完成签到 ,获得积分10
10秒前
10秒前
何y完成签到 ,获得积分10
10秒前
10秒前
wgl200212完成签到,获得积分10
11秒前
wh完成签到,获得积分10
11秒前
yuhaha完成签到,获得积分10
11秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
The Organometallic Chemistry of the Transition Metals 800
Chemistry and Physics of Carbon Volume 18 800
The Organometallic Chemistry of the Transition Metals 800
Leading Academic-Practice Partnerships in Nursing and Healthcare: A Paradigm for Change 800
The formation of Australian attitudes towards China, 1918-1941 640
Signals, Systems, and Signal Processing 610
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6436739
求助须知:如何正确求助?哪些是违规求助? 8251226
关于积分的说明 17552346
捐赠科研通 5495144
什么是DOI,文献DOI怎么找? 2898214
邀请新用户注册赠送积分活动 1875008
关于科研通互助平台的介绍 1716197