Changes to Captions: An Attentive Network for Remote Sensing Change Captioning

隐藏字幕 计算机科学 变更检测 人工智能 文字嵌入 块(置换群论) 特征提取 遥感 嵌入 图像(数学) 模式识别(心理学) 地理 几何学 数学
作者
Shizhen Chang,Pedram Ghamisi
出处
期刊:IEEE transactions on image processing [Institute of Electrical and Electronics Engineers]
卷期号:32: 6047-6060 被引量:64
标识
DOI:10.1109/tip.2023.3328224
摘要

In recent years, advanced research has focused on the direct learning and analysis of remote-sensing images using natural language processing (NLP) techniques. The ability to accurately describe changes occurring in multi-temporal remote sensing images is becoming increasingly important for geospatial understanding and land planning. Unlike natural image change captioning tasks, remote sensing change captioning aims to capture the most significant changes, irrespective of various influential factors such as illumination, seasonal effects, and complex land covers. In this study, we highlight the significance of accurately describing changes in remote sensing images and present a comparison of the change captioning task for natural and synthetic images and remote sensing images. To address the challenge of generating accurate captions, we propose an attentive changes-to-captions network, called Chg2Cap for short, for bi-temporal remote sensing images. The network comprises three main components: 1) a Siamese CNN-based feature extractor to collect high-level representations for each image pair; 2) an attentive encoder that includes a hierarchical self-attention block to locate change-related features and a residual block to generate the image embedding; and 3) a transformer-based caption generator to decode the relationship between the image embedding and the word embedding into a description. The proposed Chg2Cap network is evaluated on two representative remote sensing datasets, and a comprehensive experimental analysis is provided. The code and pre-trained models will be available online at https://github.com/ShizhenChang/Chg2Cap.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
1秒前
2秒前
灵巧的岂愈完成签到,获得积分10
2秒前
morning发布了新的文献求助10
3秒前
赘婿应助小乐子采纳,获得10
4秒前
7秒前
寻13发布了新的文献求助10
8秒前
8秒前
13秒前
14秒前
14秒前
15秒前
17秒前
dzxlgw发布了新的文献求助10
17秒前
18秒前
19秒前
Qi关闭了Qi文献求助
19秒前
20秒前
wanci应助boyue采纳,获得10
20秒前
20秒前
木月祈完成签到,获得积分10
20秒前
穿肠酒发布了新的文献求助10
21秒前
王若菡完成签到,获得积分10
21秒前
Liyo发布了新的文献求助10
22秒前
24秒前
24秒前
shijiaoshou发布了新的文献求助10
25秒前
25秒前
木月祈发布了新的文献求助10
26秒前
Ecokarster完成签到,获得积分10
26秒前
26秒前
26秒前
27秒前
佼佼者发布了新的文献求助10
28秒前
qqqq发布了新的文献求助10
30秒前
Queenie完成签到,获得积分10
30秒前
ArCaaaat发布了新的文献求助10
32秒前
33秒前
吃猫的鱼发布了新的文献求助10
33秒前
34秒前
高分求助中
GL 2 A method for assessing the in-place cleanability of food processing equipment, Fourth Edition, December 2023 3000
Annie Ernaux: De la perte au corps glorieux 600
Writing Systems 500
类器官构建与应用:从基础到前沿 500
Optical Coating Design with the Essential Macleod 400
A revision of Limenitis helmanni and its related species (Nymphalidae) from Central and South China 400
Moore's Clinically Oriented Anatomy 10th Edition 400
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6798505
求助须知:如何正确求助?哪些是违规求助? 8517403
关于积分的说明 18139355
捐赠科研通 6113316
什么是DOI,文献DOI怎么找? 3025092
邀请新用户注册赠送积分活动 2001698
关于科研通互助平台的介绍 1993446