TransUNetCD: A Hybrid Transformer Network for Change Detection in Optical Remote-Sensing Images

人工智能 模式识别(心理学) 特征(语言学) 计算机科学 加权 特征提取 变压器 卷积神经网络 解码方法 像素 算法 电压 量子力学 医学 物理 放射科 哲学 语言学
作者
Qingyang Li,Rugang Zhong,Xin Du,Yu Du
出处
期刊:IEEE Transactions on Geoscience and Remote Sensing [Institute of Electrical and Electronics Engineers]
卷期号:60: 1-19 被引量:49
标识
DOI:10.1109/tgrs.2022.3169479
摘要

In the change detection (CD) task, the UNet architecture has achieved superior results. However, due to the inherent limitation of convolution operations, UNet is inadequate in learning global context and long-range spatial relations. Transformers can capture long-range feature dependencies, but the lack of low-level details may result in limited localization capabilities. Therefore, this article proposes an end-to-end encoding–decoding hybrid transformer model for CD, TransUNetCD, which has the advantages of both transformers and UNet. The model encodes the tokenized image patches from the convolutional neural network (CNN) feature map to extract rich global context information. The decoder upsamples the encoded features, connects them with higher-resolution multiscale features through skip connections to learn local–global semantic features, and restores the full spatial resolution of the feature map to achieve precise localization. The model proposed in this article not only solves the problem that redundant information is generated when extracting low-level features under the UNet framework, but also solves the problem that the relationship between each feature layer cannot be fully modeled and the optimal feature difference representation cannot be obtained. On this basis, we introduce a difference enhancement module to generate a difference feature map containing rich change information. By weighting each pixel and selectively aggregating features, the effectiveness of the network and the accuracy of extracting changing features are improved. The results on multiple datasets demonstrate that, compared to state-of-the-art methods, the TransUNetCD can further reduce false alarms and missed alarms, and the edge of the changing area is more accurate. The model has the highest score in each metric than other baseline models and has a robust generalization ability.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
无奈钢笔完成签到,获得积分10
刚刚
star应助木木采纳,获得10
刚刚
1秒前
领导范儿应助ZWAaron采纳,获得20
4秒前
5秒前
5秒前
444发布了新的文献求助10
6秒前
6秒前
6秒前
star应助木木采纳,获得10
6秒前
小马甲应助chu采纳,获得10
7秒前
丷Geng发布了新的文献求助10
7秒前
铁铁完成签到 ,获得积分10
7秒前
7秒前
8秒前
roccc完成签到,获得积分10
8秒前
lihuahui发布了新的文献求助10
9秒前
9秒前
研友_LBoEqn发布了新的文献求助10
9秒前
FashionBoy应助啊什么都行采纳,获得10
10秒前
安全平静发布了新的文献求助20
10秒前
蓝天蓝发布了新的文献求助30
10秒前
YinC1647应助Since采纳,获得10
10秒前
10秒前
赘婿应助阿宋采纳,获得30
11秒前
13秒前
13秒前
李健应助平淡秋白采纳,获得10
14秒前
阿里发布了新的文献求助10
14秒前
15秒前
15秒前
A苏苏苏完成签到,获得积分10
15秒前
明亮盼兰发布了新的文献求助20
15秒前
16秒前
脑洞疼应助义气黄焖排骨采纳,获得10
16秒前
liv发布了新的文献求助10
17秒前
18秒前
teamguichu完成签到,获得积分10
20秒前
20秒前
聪明伊发布了新的文献求助10
20秒前
高分求助中
The three stars each : the Astrolabes and related texts 1070
Manual of Clinical Microbiology, 4 Volume Set (ASM Books) 13th Edition 1000
Sport in der Antike 800
Aspect and Predication: The Semantics of Argument Structure 666
De arte gymnastica. The art of gymnastics 600
少脉山油柑叶的化学成分研究 530
Sport in der Antike Hardcover – March 1, 2015 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2409311
求助须知:如何正确求助?哪些是违规求助? 2105252
关于积分的说明 5316657
捐赠科研通 1832725
什么是DOI,文献DOI怎么找? 913204
版权声明 560754
科研通“疑难数据库(出版商)”最低求助积分说明 488289