RSBuilding: Towards General Remote Sensing Image Building Extraction and Change Detection with Foundation Model

基础(证据) 变更检测 萃取(化学) 遥感 计算机科学 图像(数学) 计算机视觉 人工智能 地质学 地理 化学 考古 色谱法
作者
Mingze Wang,Keyan Chen,Lili Su,Cilin Yan,Sheng Xu,Haotian Zhang,Pengcheng Yuan,Xiaolong Jiang,Baochang Zhang
出处
期刊:Cornell University - arXiv 被引量:1
标识
DOI:10.48550/arxiv.2403.07564
摘要

The intelligent interpretation of buildings plays a significant role in urban planning and management, macroeconomic analysis, population dynamics, etc. Remote sensing image building interpretation primarily encompasses building extraction and change detection. However, current methodologies often treat these two tasks as separate entities, thereby failing to leverage shared knowledge. Moreover, the complexity and diversity of remote sensing image scenes pose additional challenges, as most algorithms are designed to model individual small datasets, thus lacking cross-scene generalization. In this paper, we propose a comprehensive remote sensing image building understanding model, termed RSBuilding, developed from the perspective of the foundation model. RSBuilding is designed to enhance cross-scene generalization and task universality. Specifically, we extract image features based on the prior knowledge of the foundation model and devise a multi-level feature sampler to augment scale information. To unify task representation and integrate image spatiotemporal clues, we introduce a cross-attention decoder with task prompts. Addressing the current shortage of datasets that incorporate annotations for both tasks, we have developed a federated training strategy to facilitate smooth model convergence even when supervision for some tasks is missing, thereby bolstering the complementarity of different tasks. Our model was trained on a dataset comprising up to 245,000 images and validated on multiple building extraction and change detection datasets. The experimental results substantiate that RSBuilding can concurrently handle two structurally distinct tasks and exhibits robust zero-shot generalization capabilities.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
苦行僧完成签到,获得积分10
刚刚
日常卖命完成签到 ,获得积分10
1秒前
独特的孤丹完成签到,获得积分10
1秒前
2秒前
Q1发布了新的文献求助10
2秒前
搬砖狗发布了新的文献求助10
4秒前
彳亍发布了新的文献求助10
4秒前
勤奋幻柏发布了新的文献求助30
5秒前
科研通AI6.3应助鲸落采纳,获得10
5秒前
5秒前
tan发布了新的文献求助10
5秒前
釦沐发布了新的文献求助10
6秒前
无声瀑布完成签到,获得积分10
6秒前
侃侃发布了新的文献求助10
6秒前
瘦瘦的枫叶完成签到 ,获得积分10
7秒前
科研通AI6.3应助Hohai采纳,获得10
7秒前
qiuqiuqiu发布了新的文献求助10
8秒前
9秒前
9秒前
9秒前
www发布了新的文献求助10
9秒前
10秒前
10秒前
熊大发布了新的文献求助10
11秒前
xxxxxxxxx完成签到,获得积分10
12秒前
12秒前
啊嘞嘞完成签到,获得积分10
12秒前
吃一口小羊完成签到,获得积分10
12秒前
夕夜完成签到,获得积分10
12秒前
猪猪侠应助妩媚采纳,获得10
13秒前
14秒前
阿斯顿发布了新的文献求助10
15秒前
15秒前
乐乐应助彳亍采纳,获得10
15秒前
fanyy发布了新的文献求助10
15秒前
16秒前
可靠的夜雪完成签到,获得积分10
16秒前
领导范儿应助奋斗映寒采纳,获得10
17秒前
LBJBowen23发布了新的文献求助10
17秒前
18秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Lewis’s Child and Adolescent Psychiatry: A Comprehensive Textbook Sixth Edition 2000
Continuing Syntax 1000
Encyclopedia of Quaternary Science Reference Work • Third edition • 2025 800
Signals, Systems, and Signal Processing 510
Pharma R&D Annual Review 2026 500
荧光膀胱镜诊治膀胱癌 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6220469
求助须知:如何正确求助?哪些是违规求助? 8045543
关于积分的说明 16771059
捐赠科研通 5306041
什么是DOI,文献DOI怎么找? 2826680
邀请新用户注册赠送积分活动 1804851
关于科研通互助平台的介绍 1664520