Transformer and CNN Hybrid Deep Neural Network for Semantic Segmentation of Very-High-Resolution Remote Sensing Imagery

计算机科学 人工智能 分割 编码器 卷积神经网络 棱锥(几何) 模式识别(心理学) 图像分割 特征(语言学) 深度学习 特征提取 计算机视觉 哲学 物理 光学 操作系统 语言学
作者
Cheng Zhang,Wan Jiang,Yuan Zhang,Wei Wang,Qing Zhao,Chen Jie Wang
出处
期刊:IEEE Transactions on Geoscience and Remote Sensing [Institute of Electrical and Electronics Engineers]
卷期号:60: 1-20 被引量:66
标识
DOI:10.1109/tgrs.2022.3144894
摘要

This article presents a transformer and convolutional neural network (CNN) hybrid deep neural network for semantic segmentation of very high resolution (VHR) remote sensing imagery. The model follows an encoder–decoder structure. The encoder module uses a new universal backbone Swin transformer to extract features to achieve better long-range spatial dependencies modeling. The decoder module draws on some effective blocks and successful strategies of CNN-based models in remote sensing image segmentation. In the middle of the framework, an atrous spatial pyramid pooling block based on depthwise separable convolution (SASPP) is applied to obtain a multiscale context. A U-shaped decoder is used to gradually restore the size of the feature maps. Three skip connections are built between the encoder and decoder feature maps of the same size to maintain the transmission of local details and enhance the communication of multiscale features. A squeeze-and-excitation (SE) channel attention block is added before segmentation for feature augmentation. An auxiliary boundary detection branch is combined to provide edge constraints for semantic segmentation. Extensive ablation experiments were conducted on the International Society for Photogrammetry and Remote Sensing (ISPRS) Vaihingen and Potsdam benchmarks to test the effectiveness of multiple components of the network. At the same time, the proposed method is compared with the current state-of-the-art methods on the two benchmarks. The proposed hybrid network achieved the second highest overall accuracy (OA) on both the Potsdam and Vaihingen benchmarks (code and models are available at https://github.com/zq7734509/mmsegmentation- multilayer).
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
zzzzzdz完成签到,获得积分10
1秒前
YANGLan完成签到,获得积分10
3秒前
华仔应助幸福诗槐采纳,获得10
3秒前
4秒前
叮当猫的悠闲生活完成签到 ,获得积分10
4秒前
7秒前
8秒前
快乐的龙猫完成签到,获得积分10
9秒前
QUQUE发布了新的文献求助10
9秒前
sxy完成签到,获得积分10
9秒前
12秒前
汉堡包应助小巧的芷珍采纳,获得10
12秒前
ZQP发布了新的文献求助10
13秒前
15秒前
FashionBoy应助knight0524采纳,获得10
16秒前
19秒前
ZQP完成签到,获得积分10
19秒前
xun发布了新的文献求助10
24秒前
24秒前
sophia完成签到,获得积分10
25秒前
小贾完成签到,获得积分10
25秒前
尊敬幼丝完成签到,获得积分10
25秒前
donfern完成签到,获得积分10
25秒前
26秒前
32秒前
hw完成签到 ,获得积分10
33秒前
35秒前
舒适太阳发布了新的文献求助10
37秒前
ddbb发布了新的文献求助10
37秒前
李健的小迷弟应助葛zh采纳,获得10
39秒前
40秒前
40秒前
小时候喜欢迪迦完成签到,获得积分10
40秒前
Siwen发布了新的文献求助10
42秒前
43秒前
汉堡包应助令狐擎宇采纳,获得10
43秒前
黑娃完成签到,获得积分10
46秒前
CipherSage应助ljn采纳,获得10
47秒前
52秒前
年轻的笙完成签到,获得积分10
54秒前
高分求助中
Manual of Clinical Microbiology, 4 Volume Set (ASM Books) 13th Edition 1000
Chinese-English Translation Lexicon Version 3.0 500
Electronic Structure Calculations and Structure-Property Relationships on Aromatic Nitro Compounds 500
マンネンタケ科植物由来メロテルペノイド類の網羅的全合成/Collective Synthesis of Meroterpenoids Derived from Ganoderma Family 500
薩提亞模式團體方案對青年情侶輔導效果之研究 400
[Lambert-Eaton syndrome without calcium channel autoantibodies] 400
Statistical Procedures for the Medical Device Industry 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2380023
求助须知:如何正确求助?哪些是违规求助? 2087232
关于积分的说明 5240624
捐赠科研通 1814332
什么是DOI,文献DOI怎么找? 905220
版权声明 558734
科研通“疑难数据库(出版商)”最低求助积分说明 483242