Diffusion-RSCC: Diffusion Probabilistic Model for Change Captioning in Remote Sensing Images

扩散 概率逻辑 隐藏字幕 计算机科学 遥感 人工智能 地质学 图像(数学) 物理 热力学
作者
Xiaofei Yu,Yitong Li,Jie Ma,Chang Li,Hanlin Wu
出处
期刊:IEEE Transactions on Geoscience and Remote Sensing [Institute of Electrical and Electronics Engineers]
卷期号:63: 1-13 被引量:17
标识
DOI:10.1109/tgrs.2025.3554360
摘要

Remote sensing image change captioning (RSICC) aims at generating human-like language to describe the semantic changes between bitemporal remote sensing image (RSI) pairs. It provides valuable insight into environmental dynamics and land management. Unlike conventional change captioning (CC) tasks, RSICC involves not only retrieving relevant information across different modalities and generating fluent captions but also mitigating the impact of pixel-level differences on the terrain change localization. Pixel-level discrepancies over a long time span decrease caption accuracy. To address these problems, we propose a probabilistic diffusion-based model that leverages its remarkable generative capability to produce flexible captions. In the training phase, we construct a condition denoiser to efficiently map the real caption distribution to a standard Gaussian distribution. This denoiser incorporates cross-mode fusion (CMF) and stacking self-attention (SSA) modules to enhance cross-modal alignment and reduce pixel interference, thereby improving caption accuracy. In the training phase, the condition denoiser provides a new strategy for mean value estimation and helps to generate captions step by step. Extensive experiments on the LEVIR-CC dataset and DUBAI-CC dataset demonstrate the effectiveness of our Diffusion-RSCC and each of its individual components. The quantitative results showcase superior performance over existing methods across both traditional and newly introduced metrics. The code is available at: https://github.com/Fay-Y/Diffusion-RSCC.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
1秒前
专注鼠标完成签到,获得积分10
1秒前
1秒前
络桵完成签到,获得积分10
1秒前
魏猛完成签到,获得积分10
1秒前
2秒前
苑QDU完成签到,获得积分20
2秒前
2秒前
xixi完成签到,获得积分10
2秒前
2秒前
Owen应助cherry采纳,获得10
2秒前
萧萧萧完成签到,获得积分10
2秒前
2秒前
3秒前
祖逸凡完成签到,获得积分10
3秒前
好运偏爱的那个男的完成签到,获得积分10
3秒前
Rachel完成签到,获得积分10
3秒前
3秒前
3秒前
4秒前
你好完成签到,获得积分10
4秒前
SZY发布了新的文献求助20
4秒前
4秒前
思源应助冷傲的元容采纳,获得10
4秒前
wyy发布了新的文献求助10
4秒前
苹果发夹发布了新的文献求助10
4秒前
5秒前
欠虐宝宝完成签到 ,获得积分10
5秒前
keming发布了新的文献求助10
5秒前
明明明发布了新的文献求助10
5秒前
喃喃完成签到,获得积分10
5秒前
赵敏发布了新的文献求助10
5秒前
5秒前
缥缈的背包完成签到,获得积分10
6秒前
苑QDU发布了新的文献求助10
6秒前
酷波er应助长情诗蕾采纳,获得10
7秒前
avalon发布了新的文献求助10
7秒前
7秒前
8秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
The Organometallic Chemistry of the Transition Metals 800
Chemistry and Physics of Carbon Volume 18 800
The Organometallic Chemistry of the Transition Metals 800
Leading Academic-Practice Partnerships in Nursing and Healthcare: A Paradigm for Change 800
The formation of Australian attitudes towards China, 1918-1941 640
Signals, Systems, and Signal Processing 610
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6437304
求助须知:如何正确求助?哪些是违规求助? 8251713
关于积分的说明 17556241
捐赠科研通 5495580
什么是DOI,文献DOI怎么找? 2898439
邀请新用户注册赠送积分活动 1875241
关于科研通互助平台的介绍 1716270