SED-DETR: A Scale-Enhanced Deformable Detection Transformer for Remote Sensing Images

遥感 计算机科学 比例(比率) 计算机视觉 人工智能 地质学 地图学 地理
作者
Haitao Yin,Zhuyun Zhu,He Wang
出处
期刊:IEEE Transactions on Geoscience and Remote Sensing [Institute of Electrical and Electronics Engineers]
卷期号:63: 1-12 被引量:9
标识
DOI:10.1109/tgrs.2025.3571033
摘要

Detection Transformer (DETR) has emerged as a highly promising approach in object detection and has attracted significant interest. However, most DETR-like methods cannot simultaneously leverage the shape and scale priors for attention calculation, resulting in limited performance in detecting remote sensing objects with diverse shapes and scales. To address this issue, this article proposes a Scale-Enhanced Deformable DETR (SED-DETR) for remote sensing object detection (RSOD). The core component of SED-DETR is Scale-Enhanced Deformable Attention (SEDA), which is designed based on the principles of deformable shape and dynamic scale. Specifically, the SEDA module utilizes multi-scale attention heads. First, conventional multiple attention heads are consolidated into several scale-heads through an adaptive scale aggregation approach, which dynamically adjusts the distributions of different scales to enhance the scale-aware modeling ability. For each scale-head, dilated sampling is applied at a specific dilation rate to capture multi-scale receptive fields. The sampled positions are further refined by learnable offsets predicted from query features, enabling a deformable dilated mechanism for fine-grained feature extraction of multi-scale instances. Finally, we adopt the mixed query selection and the denoising training defined in DINO to implement SED-DETR. Experimental results on the xView, DIOR, NWPU VHR-10 and COCO datasets demonstrate that SED-DETR outperforms state-of-the-art DETR-like methods. Specifically, SED-DETR achieves 5.6%, 10.9%, and 8.6% mAP gains over the baseline Deformable DETR on the xView, DIOR, and NWPU VHR-10 datasets, respectively. The source code is available at https://github.com/zzy599/SEDDETR.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
正直的孤丝完成签到,获得积分10
1秒前
ccccl完成签到,获得积分10
1秒前
新新辛欣完成签到,获得积分10
1秒前
Mao完成签到,获得积分0
2秒前
2秒前
温暖白亦完成签到,获得积分10
2秒前
李薇发布了新的文献求助10
3秒前
3秒前
3秒前
假装有昵称完成签到,获得积分10
4秒前
迷路睫毛完成签到,获得积分10
4秒前
缥缈的绿兰完成签到,获得积分10
4秒前
leodu完成签到,获得积分10
4秒前
小王发布了新的文献求助10
5秒前
沐风完成签到,获得积分10
5秒前
情怀应助大蛋采纳,获得10
6秒前
FD完成签到,获得积分10
6秒前
6秒前
蔷薇完成签到 ,获得积分10
7秒前
科研通AI2S应助zlsn采纳,获得10
7秒前
xinxinfenghuo完成签到,获得积分10
7秒前
laowang完成签到,获得积分10
7秒前
韩十四完成签到,获得积分10
7秒前
scinature发布了新的文献求助10
7秒前
孙兆杰发布了新的文献求助10
8秒前
小林子完成签到,获得积分10
8秒前
8秒前
nannan完成签到,获得积分10
8秒前
不喜欢孜然完成签到,获得积分10
9秒前
激情的冰绿完成签到 ,获得积分10
9秒前
疯狂加载ing给xieyue的求助进行了留言
9秒前
LALALA卫卫J完成签到,获得积分10
9秒前
11秒前
11秒前
11秒前
huang应助小林子采纳,获得10
12秒前
13秒前
火星上立果完成签到,获得积分10
13秒前
糯米糕发布了新的文献求助10
14秒前
高分求助中
Principles of Economics, 11th Edition 10000
Prescott's Microbiology: 2026 Release ISE 10000
University Physics with Modern Physics, 16th edition 10000
Cronologia da história de Macau 5000
Merrill's Atlas of Radiographic Positioning and Procedures - 3-Volume Set, 16th Edition 2000
Interactions of Vowel Quality and Prosody in East Slavic 1000
Erwählung und Berufung bei Paulus: Bedeutung, Entwicklung und Funktion einer Vorstellung in ihrem frühjüdischen und griechisch-römischen Kontext 850
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 内科学 物理 复合材料 催化作用 细胞生物学 无机化学 光电子学 物理化学 电极 基因
热门帖子
关注 科研通微信公众号,转发送积分 7148062
求助须知:如何正确求助?哪些是违规求助? 8794456
关于积分的说明 18585779
捐赠科研通 6743530
什么是DOI,文献DOI怎么找? 3158531
关于科研通互助平台的介绍 2289996
邀请新用户注册赠送积分活动 2132937