发布文献求助

Towards Label-free Scene Understanding by Vision Foundation Models

计算机科学分割一致性（知识库）人工智能正规化（语言学）编码（集合论）代表（政治）特征（语言学）特征学习深层神经网络基础（证据）机器学习计算机视觉深度学习政治政治学法学历史程序设计语言语言学哲学集合（抽象数据类型）考古

作者

Runnan Chen,Youquan Liu,Lingdong Kong,Nenglun Chen,Xinge Zhu,Yuexin Ma,Tongliang Liu,Wenping Wang

出处

期刊：Cornell University - arXiv 日期：2023-01-01 被引量：5

链接

arxiv.org datacite.orgdoi.org

标识

DOI：10.48550/arxiv.2306.03899

摘要

Vision foundation models such as Contrastive Vision-Language Pre-training (CLIP) and Segment Anything (SAM) have demonstrated impressive zero-shot performance on image classification and segmentation tasks. However, the incorporation of CLIP and SAM for label-free scene understanding has yet to be explored. In this paper, we investigate the potential of vision foundation models in enabling networks to comprehend 2D and 3D worlds without labelled data. The primary challenge lies in effectively supervising networks under extremely noisy pseudo labels, which are generated by CLIP and further exacerbated during the propagation from the 2D to the 3D domain. To tackle these challenges, we propose a novel Cross-modality Noisy Supervision (CNS) method that leverages the strengths of CLIP and SAM to supervise 2D and 3D networks simultaneously. In particular, we introduce a prediction consistency regularization to co-train 2D and 3D networks, then further impose the networks' latent space consistency using the SAM's robust feature representation. Experiments conducted on diverse indoor and outdoor datasets demonstrate the superior performance of our method in understanding 2D and 3D open environments. Our 2D and 3D network achieves label-free semantic segmentation with 28.4\% and 33.5\% mIoU on ScanNet, improving 4.7\% and 7.9\%, respectively. For nuImages and nuScenes datasets, the performance is 22.1\% and 26.8\% with improvements of 3.5\% and 6.0\%, respectively. Code is available. (https://github.com/runnanchen/Label-Free-Scene-Understanding).

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 历史今天

活动

『应助活动周』获奖名单已公布 🔥 (2025-4-2)

更新

『中科院2025期刊分区』已更新 (2025-3-23)

更新

『即时热点』模块已上线 (2025-2-28)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: Echo完成签到，获得积分20

3秒前; 君子不器发布了新的文献求助10

3秒前; 牛奶牛奶完成签到，获得积分10

6秒前; 打打的应助被眠眠羊采纳，获得10

7秒前; 英俊的铭的应助被江文采纳，获得10

8秒前; 脑洞疼上传了应助文件

9秒前; Orange的应助被简单水蓉采纳，获得30

12秒前; cherish发布了新的文献求助10

14秒前; 霹雳侠完成签到，获得积分10

16秒前; 忐忑的凌丝完成签到，获得积分10

18秒前; tree发布了新的文献求助10

19秒前; 共享精神的应助被人不犯二枉少年采纳，获得10

21秒前; 今后上传了应助文件

22秒前; 科研通AI5的应助被韩先生采纳，获得10

22秒前; 科研通AI5上传了应助文件

24秒前; 123发布了新的文献求助10

25秒前; AteeqBaloch发布了新的文献求助10

26秒前; less12323完成签到，获得积分10

26秒前; 共享精神上传了应助文件

28秒前; 小陈儿发布了新的文献求助10

28秒前; LIUYONG完成签到，获得积分10

29秒前; CipherSage上传了应助文件

30秒前; qq完成签到，获得积分10

31秒前; 深情安青上传了应助文件

32秒前; FashionBoy上传了应助文件

32秒前; 人不犯二枉少年发布了新的文献求助10

33秒前; 宋小雅完成签到，获得积分10

33秒前; 叶白山完成签到，获得积分10

33秒前; 万能图书馆上传了应助文件

34秒前; 情怀上传了应助文件

34秒前; NexusExplorer上传了应助文件

34秒前; 孙燕上传了应助文件

36秒前; ZhouYW的应助被执着静竹采纳，获得10

36秒前; imkhun1021发布了新的文献求助10

37秒前; qq发布了新的文献求助10

37秒前; 我是老大上传了应助文件

37秒前; 北斗完成签到，获得积分10

38秒前; zhui发布了新的文献求助10

38秒前; 甜蜜晓绿驳回了李爱国的应助

38秒前; 豆蔻子发布了新的文献求助10

39秒前

高分求助中: Encyclopedia of Mathematical Physics 2nd edition 888; Technologies supporting mass customization of apparel: A pilot project 600; 材料概论周达飞 ppt 500; Nonrandom distribution of the endogenous retroviral regulatory elements HERV-K LTR on human chromosome 22 500; Introduction to Strong Mixing Conditions Volumes 1-3 500; Optical and electric properties of monocrystalline synthetic diamond irradiated by neutrons 320; 科学教育中的科学本质 300

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 3806839; 求助须知：如何正确求助？哪些是违规求助？ 3351587; 关于积分的说明 10354846; 捐赠科研通 3067401; 什么是DOI，文献DOI怎么找？ 1684517; 邀请新用户注册赠送积分活动 809780; 科研通“疑难数据库（出版商）”最低求助积分说明 765635

今日热心研友

昏睡的蟠桃

卡皮巴拉yuan

平常的毛豆

一颗西红柿

jenningseastera

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2025 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助，群号：941272744【点击一键加群】

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通