发布文献求助

亲爱的研友该休息了！由于当前在线用户较少，发布求助请尽量完整的填写文献信息，科研通机器人24小时在线，伴您度过漫漫科研夜！身体可是革命的本钱，早点休息，好梦！

CLIP-VG: Self-Paced Curriculum Adapting of CLIP for Visual Grounding

计算机科学边距（机器学习）编码（集合论）人工智能源代码测距语言模型机器学习模式识别（心理学）自然语言处理程序设计语言集合（抽象数据类型）电信

作者

Lin Xiao,Xiaoshan Yang,Fang Peng,Ming Yan,Yaowei Wang,Changsheng Xu

出处

期刊：IEEE Transactions on Multimedia [Institute of Electrical and Electronics Engineers]
日期：2024-01-01 卷期号：26: 4334-4347

链接

arxiv.org arxiv.orgdoi.org

标识

DOI：10.1109/tmm.2023.3321501

摘要

Visual Grounding (VG) is a crucial topic in the field of vision and language, which involves locating a specific region described by expressions within an image.To reduce the reliance on manually labeled data, unsupervised visual grounding have been developed to locate regions using pseudo-labels.However, the performance of existing unsupervised methods is highly dependent on the quality of pseudo-labels and these methods always encounter issues with limited diversity.In order to utilize vision and language pre-trained models to address the grounding problem, and reasonably take advantage of pseudo-labels, we propose CLIP-VG, a novel method that can conduct self-paced curriculum adapting of CLIP with pseudo-language labels.We propose a simple yet efficient end-to-end network architecture to realize the transfer of CLIP to the visual grounding.Based on the CLIP-based architecture, we further propose single-source and multi-source curriculum adapting algorithms, which can progressively find more reliable pseudo-labels to learn an optimal model, thereby achieving a balance between reliability and diversity for the pseudo-language labels.Our method outperforms the current state-of-the-art unsupervised method by a significant margin on RefCOCO/+/g datasets in both single-source and multi-source scenarios, with improvements ranging from 6.78% to 10.67% and 11.39% to 14.87%, respectively.The results even outperform existing weakly supervised methods.Furthermore, our method is also competitive in fully supervised setting.The code and models are available at https://github.com/linhuixiao/CLIP-VG.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

活动

『应助活动周』获奖名单已公布 🔥 (2025-4-2)

更新

『中科院2025期刊分区』已更新 (2025-3-23)

更新

『即时热点』模块已上线 (2025-2-28)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: 科研通AI2S上传了应助文件

2秒前; Lucas上传了应助文件

3秒前; orixero上传了应助文件

3秒前; cdercder上传了应助文件

6秒前; Hao发布了新的文献求助10

7秒前; Carrido发布了新的文献求助10

8秒前; 在水一方的应助被Hao采纳，获得10

13秒前; 丘比特的应助被Carrido采纳，获得10

25秒前; 希望天下0贩的0的应助被陈陈采纳，获得10

27秒前; 过氧化氢上传了应助文件

29秒前; orixero的应助被白白拜拜采纳，获得10

33秒前; Michael的应助被科研通管家采纳，获得20

35秒前; 科研通AI2S的应助被科研通管家采纳，获得10

35秒前; 希望天下0贩的0上传了应助文件

36秒前; 丘比特上传了应助文件

38秒前; 陈陈发布了新的文献求助10

39秒前; Carrido发布了新的文献求助10

43秒前; iebdus123完成签到，获得积分10

1分钟前; wanci的应助被Carrido采纳，获得10

1分钟前; wanci上传了应助文件

1分钟前; 科研通AI5上传了应助文件

1分钟前; 遇上就这样吧上传了应助文件

1分钟前; Carrido发布了新的文献求助10

1分钟前; 科研通AI5的应助被调皮帆布鞋采纳，获得10

1分钟前; 十四发布了新的文献求助10

1分钟前; NexusExplorer的应助被Carrido采纳，获得10

1分钟前; 豆豆哥完成签到，获得积分10

1分钟前; FashionBoy的应助被十四采纳，获得10

1分钟前; 眯眯眼的黎昕完成签到，获得积分10

1分钟前; NexusExplorer上传了应助文件

2分钟前; Carrido发布了新的文献求助10

2分钟前; 宋亚佩完成签到，获得积分10

2分钟前; 深情安青的应助被Carrido采纳，获得10

2分钟前; oleskarabach发布了新的文献求助10

2分钟前; 不冻泉的水驳回了共享精神的应助

2分钟前; Sunsets发布了新的文献求助10

2分钟前; mmyhn完成签到，获得积分10

2分钟前; 深情安青上传了应助文件

2分钟前; bkagyin的应助被科研通管家采纳，获得10

2分钟前; hank完成签到，获得积分10

2分钟前

高分求助中: Applied Survey Data Analysis (第三版, 2025) 800; Assessing and Diagnosing Young Children with Neurodevelopmental Disorders (2nd Edition) 700; The Elgar Companion to Consumer Behaviour and the Sustainable Development Goals 540; Images that translate 500; Handbook of Innovations in Political Psychology 400; Mapping the Stars: Celebrity, Metonymy, and the Networked Politics of Identity 400; Towards a spatial history of contemporary art in China 300

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3843203; 求助须知：如何正确求助？哪些是违规求助？ 3385459; 关于积分的说明 10540518; 捐赠科研通 3106021; 什么是DOI，文献DOI怎么找？ 1710846; 邀请新用户注册赠送积分活动 823778; 科研通“疑难数据库（出版商）”最低求助积分说明 774264

今日热心研友

无语的安白

小茄子爷爷

遇上就这样吧

今天只做一件事

熬夜的小王

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通