发布文献求助

清晨好，您是今天最早来到科研通的研友！由于当前在线用户较少，发布求助请尽量完整地填写文献信息，科研通机器人24小时在线，伴您科研之路漫漫前行！

Language Query based Transformer with Multi-Scale Cross-Modal Alignment for Visual Grounding on Remote Sensing Images

计算机科学变压器接地情态动词遥感比例（比率）计算机视觉人工智能电压地质学电气工程工程类地理化学地图学高分子化学

作者

Meng Lan,Fu Rong,Hongzan Jiao,Zhi Gao,Lefei Zhang

出处

期刊：IEEE Transactions on Geoscience and Remote Sensing [Institute of Electrical and Electronics Engineers]
日期：2024-01-01 卷期号：62: 1-13 被引量：2

标识

DOI：10.1109/tgrs.2024.3407598

摘要

Visual grounding for remote sensing images (RSVG) aims to localize the referred objects in the remote sensing (RS) images according to a language expression. Existing methods tend to align visual and text features followed by concatenation and then employ a fusion Transformer to learn a token representation for final target localization. However, simple fusion Transformer structure fails to sufficiently learn the location representation of referred object from the multi-modal features. Inspired by the detection Transformer, in this paper, we propose a novel language query based Transformer framework for RSVG termed LQVG. Specifically, we adopt the extracted sentence-level text features as the queries, called language queries, to retrieve and aggregate representation information of the referred object from the multi-scale visual features in the Transformer decoder. The language queries are then converted into object embeddings for final coordinate prediction of referred object. Besides, a multi-scale cross-modal alignment module is devised before the multimodal Transformer to enhance the semantic correlation between the visual and text features, thus facilitating the cross-modal decoding process to generate more precise object representation. Moreover, a new RSVG dataset named RSVG-HR is built to evaluate the performance of the RSVG approaches on very high-resolution remote sensing images with inconspicuous objects. Experimental results on two benchmark datasets demonstrate that our proposed method significantly surpasses the comparison methods and achieves state-of-the-art performance. The dataset and code are available at https://github.com/LANMNG/LQVG.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

2025年影响因子查询已上线 (2025-6-18)

更新

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: NexusExplorer上传了应助文件

21秒前; HY完成签到，获得积分10

29秒前; 量子星尘发布了新的文献求助10

30秒前; gexzygg的应助被科研通管家采纳，获得10

32秒前; 随心所欲完成签到，获得积分10

1分钟前; 天天快乐上传了应助文件

1分钟前; 万能图书馆上传了应助文件

1分钟前; 量子星尘发布了新的文献求助10

1分钟前; 壮观问寒发布了新的文献求助10

2分钟前; 壮观问寒完成签到，获得积分10

2分钟前; 华仔上传了应助文件

2分钟前; gexzygg的应助被科研通管家采纳，获得10

2分钟前; gexzygg的应助被科研通管家采纳，获得10

2分钟前; 小智发布了新的文献求助10

2分钟前; 量子星尘发布了新的文献求助10

3分钟前; Hu完成签到，获得积分20

4分钟前; crazy完成签到，获得积分10

4分钟前; 激动的似狮完成签到，获得积分10

4分钟前; lilaccalla完成签到，获得积分10

4分钟前; 习月阳完成签到，获得积分10

4分钟前; 莽兽鳞上最黑的皮完成签到，获得积分10

4分钟前; 量子星尘发布了新的文献求助10

4分钟前; 灿烂而孤独的八戒完成签到，获得积分0

4分钟前; Ava上传了应助文件

5分钟前; 萝卜猪完成签到，获得积分10

5分钟前; pinglanqi发布了新的文献求助10

5分钟前; 如歌完成签到，获得积分10

5分钟前; 善学以致用上传了应助文件

5分钟前; 阿怪发布了新的文献求助10

5分钟前; 玛卡巴卡爱吃饭完成签到，获得积分10

5分钟前; 阿怪完成签到，获得积分10

5分钟前; P_Chem完成签到，获得积分10

5分钟前; 量子星尘发布了新的文献求助10

6分钟前; 小蘑菇上传了应助文件

6分钟前; 丘比特的应助被研友_拓跋戾采纳，获得10

6分钟前; 丘比特上传了应助文件

6分钟前; 研友_拓跋戾发布了新的文献求助10

7分钟前; 量子星尘发布了新的文献求助10

7分钟前; 情怀上传了应助文件

7分钟前; 小新小新完成签到，获得积分10

7分钟前

高分求助中: (应助此贴封号)【重要！！请各位详细阅读】【科研通的精品贴汇总】 10000; Organic Chemistry 1500; The Netter Collection of Medical Illustrations: Digestive System, Volume 9, Part III - Liver, Biliary Tract, and Pancreas （3rd Edition） 600; 塔里木盆地肖尔布拉克组微生物岩沉积层序与储层成因 500; Assessment of adverse effects of Alzheimer's disease medications: Analysis of notifications to Regional Pharmacovigilance Centers in Northwest France 400; Introducing Sociology Using the Stuff of Everyday Life 400; Conjugated Polymers: Synthesis & Design 400

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 4270375; 求助须知：如何正确求助？哪些是违规求助？ 3800854; 关于积分的说明 11910965; 捐赠科研通 3447688; 什么是DOI，文献DOI怎么找？ 1891031; 邀请新用户注册赠送积分活动 941779; 科研通“疑难数据库（出版商）”最低求助积分说明 845885

今日热心研友

忧心的曼凝

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通