发布文献求助

已入深夜，您辛苦了！由于当前在线用户较少，发布求助请尽量完整地填写文献信息，科研通机器人24小时在线，伴您度过漫漫科研夜！祝你早点完成任务，早点休息，好梦！

VisualRAG: Knowledge-Guided Retrieval Augmentation for Image-Text Matching

计算机科学图像检索人工智能计算机视觉图像匹配图像（数学）匹配（统计）模式识别（心理学）情报检索数学统计

作者

Hengchang Wang,Li Liu,Huaxiang Zhang,Lei Zhu,Xiaojun Chang,Hao Du

出处

期刊：IEEE Transactions on Circuits and Systems for Video Technology [Institute of Electrical and Electronics Engineers]
日期：2025-08-08 卷期号：36 (1): 1234-1248 被引量：1

标识

DOI：10.1109/tcsvt.2025.3597097

摘要

Image-text matching as a fundamental cross-modal understanding task presents unique challenges in weakly-aligned scenarios. Such data typically feature highly abstract textual captions with sparse entity references, creating a significant semantic gap with visual content. Current mainstream methods, primarily designed for strongly aligned data pairs, employ dynamic modeling or multi-dimensional similarity computation to achieve feature space mapping. However, they struggle with information asymmetry and modal heterogeneity in weakly aligned cases. To address this, we propose a Visual Perception Knowledge Enhancement (VPKE) framework. Unlike existing methods based on strong alignment assumptions, this framework mines latent image semantics through vision-language models and generates auxiliary captions, overcoming the information bottleneck of traditional text modalities. Its core innovation lies in an adaptive knowledge distillation mechanism that combines retrieval-augmented generation (RAG) with key entity extraction. This mechanism effectively filters noise when introducing external knowledge while optimizing cross-modal feature integration. The framework employs multi-level similarity evaluation to dynamically adjust fusion weights among original text, key entities, and auxiliary captions, enabling adaptive integration of diverse semantic features and significantly improving model flexibility. Additionally, multi-scale feature extraction further enhances cross-modal representation capabilities. Experimental results show that the proposed method performs excellently in image-text retrieval tasks on the MSCOCO and Flickr30K datasets, validating its effectiveness.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

更新

📰 新增『新锐期刊分区』 (2026-3-24)

更新

💬 新增更精细的自定义提醒设置 (2026-1-4)

新增

🕒 每天60秒读懂世界·精选全球要闻 (2026-1-2)

新增

PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 大个的应助被小轩窗zst采纳，获得10

2秒前; shark发布了新的文献求助10

4秒前; 积极的康宝驳回了斯文败类的应助

6秒前; wanci上传了应助文件

7秒前; 隐形曼青上传了应助文件

9秒前; 李健上传了应助文件

9秒前; 完美世界的应助被罗媛采纳，获得10

9秒前; 希望天下0贩的0的应助被freddy采纳，获得10

10秒前; 迷你的延恶发布了新的文献求助30

11秒前; 乐乐上传了应助文件

11秒前; 坚定的奇异果完成签到，获得积分10

11秒前; 万能图书馆上传了应助文件

12秒前; 万能图书馆上传了应助文件

12秒前; 麻麻珍妮斯发布了新的文献求助10

13秒前; Seagull发布了新的文献求助10

14秒前; cat发布了新的文献求助10

15秒前; Lexie发布了新的文献求助10

16秒前; wuwen发布了新的文献求助10

16秒前; 之尔完成签到，获得积分10

16秒前; JamesPei的应助被iligll采纳，获得10

19秒前; 无极微光上传了应助文件

19秒前; 拾玖发布了新的文献求助10

20秒前; 上官若男上传了应助文件

20秒前; najd完成签到，获得积分10

21秒前; SciGPT上传了应助文件

21秒前; 彭于晏上传了应助文件

22秒前; 张大点完成签到，获得积分20

23秒前; 坚定的若枫发布了新的文献求助10

24秒前; 烟花上传了应助文件

25秒前; 研友_VZG7GZ上传了应助文件

25秒前; 曾祥钰发布了新的文献求助10

25秒前; 桐桐的应助被cat采纳，获得10

25秒前; 顺利八宝粥发布了新的文献求助10

25秒前; 领导范儿的应助被缥缈静珊采纳，获得10

26秒前; Tiejian发布了新的文献求助10

26秒前; 大个上传了应助文件

28秒前; Sandy完成签到，获得积分10

28秒前; liufool发布了新的文献求助10

29秒前; pepe发布了新的文献求助10

30秒前; 大模型上传了应助文件

31秒前

高分求助中: (应助此贴封号)【重要！！请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000; The Graphene Handbook (2019 Edition) 800; IEST-RP-CC018: Cleanroom Cleaning and Sanitization: Operating and Monitoring Procedures 600; Fundamentals of Pharmaceutical and Biologics Regulations: A Global Perspective, Second Edition 600; 久松真一著作集〈第5巻〉禅と芸術 500; Fundamentals of Modern Mathematics: A Practical Review (Dover Books on Mathematics) 500; Cold War Transcended: Australia's China Policy, 1949-1990 470

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 6587273; 求助须知：如何正确求助？哪些是违规求助？ 8360749; 关于积分的说明 17903188; 捐赠科研通 5730663; 什么是DOI，文献DOI怎么找？ 2950165; 邀请新用户注册赠送积分活动 1925626; 关于科研通互助平台的介绍 1813061

今日热心研友

AllRightReserved

殷勤的紫槐

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2026 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：821889395【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通