亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

CAST: Component-Aligned 3D Scene Reconstruction from an RGB Image

计算机视觉 人工智能 组分(热力学) 计算机图形学(图像) 计算机科学 图像(数学) RGB颜色模型 三维重建 迭代重建 热力学 物理
作者
Kaixin Yao,Xu Yan,Yan Zeng,Qixuan Zhang,Lan Xu,Wei Yang,Jiayuan Gu,Jingyi Yu
出处
期刊:ACM Transactions on Graphics [Association for Computing Machinery]
卷期号:44 (4): 1-19 被引量:1
标识
DOI:10.1145/3730841
摘要

Recovering high-quality 3D scenes from a single RGB image is a challenging task in computer graphics. Current methods often struggle with domain-specific limitations or low-quality object generation. To address these, we propose CAST (Component-Aligned 3D Scene Reconstruction from a Single RGB Image), a novel method for 3D scene reconstruction. CAST starts by extracting object-level 2D segmentation and relative depth information from the input image, followed by using a GPT-based model to analyze inter-object spatial relations. This enables understanding of how objects relate to each other within the scene, ensuring more coherent reconstruction. CAST then employs an occlusion-aware large-scale 3D generation model to independently generate each object's full geometry, using Masked Auto Encoder (MAE) and point cloud conditioning to mitigate the effects of occlusions and partial object information, ensuring accurate alignment with the source image's geometry and texture. To align each object with the scene, the alignment generation model computes the necessary transformations, allowing the generated meshes to be accurately placed and integrated into the scene's point cloud. Finally, CAST applies a physics-aware correction mechanism, which leverages a fine-grained relation graph to generate a constraint graph. This graph guides the optimization of object poses, ensuring physical consistency and spatial coherence. By utilizing Signed Distance Fields (SDF), the model effectively addresses issues such as occlusions, object penetration, and floating objects, ensuring that the generated scene accurately reflects real-world physical interactions. Experimental results demonstrate that CAST significantly improves the quality of single-image 3D scene reconstruction, offering enhanced realism and accuracy in scene understanding and reconstruction tasks. CAST has practical applications in virtual content creation, such as immersive game environments and film production, where real-world setups can be seamlessly integrated into virtual landscapes. Additionally, CAST can be leveraged in robotics, enabling efficient real-to-simulation workflows and providing realistic, scalable simulation environments for robotic systems.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
16秒前
shishi发布了新的文献求助10
23秒前
柯语雪完成签到 ,获得积分10
30秒前
38秒前
sala发布了新的文献求助10
43秒前
48秒前
科研兄发布了新的文献求助10
48秒前
50秒前
科目三应助轻语采纳,获得10
50秒前
53秒前
sala完成签到,获得积分20
55秒前
九局下半发布了新的文献求助10
56秒前
56秒前
57秒前
传奇3应助科研兄采纳,获得10
57秒前
嘟呜发布了新的文献求助10
59秒前
万能图书馆应助MM采纳,获得10
1分钟前
布叻发布了新的文献求助10
1分钟前
轻语发布了新的文献求助10
1分钟前
大模型应助科研通管家采纳,获得10
1分钟前
领导范儿应助科研通管家采纳,获得10
1分钟前
充电宝应助野性的飞绿采纳,获得30
1分钟前
龙江阿祖完成签到,获得积分10
1分钟前
脆皮黑巧完成签到 ,获得积分10
1分钟前
科研兄完成签到,获得积分10
1分钟前
1分钟前
1分钟前
丘比特应助独特的鹅采纳,获得10
1分钟前
1分钟前
1分钟前
LI发布了新的文献求助10
1分钟前
优雅的大白菜完成签到 ,获得积分10
1分钟前
chen完成签到,获得积分10
1分钟前
1分钟前
我小怂怂006完成签到 ,获得积分10
1分钟前
LI完成签到,获得积分10
2分钟前
2分钟前
2分钟前
2分钟前
uuuuuuu发布了新的文献求助10
2分钟前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
The Graphene Handbook (2019 Edition) 800
Adhesion Science: Principles & Practice 800
Signals, Systems, and Signal Processing 610
IEST-RP-CC018: Cleanroom Cleaning and Sanitization: Operating and Monitoring Procedures 600
Fundamentals of Pharmaceutical and Biologics Regulations: A Global Perspective, Second Edition 600
久松真一著作集〈第5巻〉禅と芸術 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6534688
求助须知:如何正确求助?哪些是违规求助? 8327828
关于积分的说明 17839660
捐赠科研通 5636174
什么是DOI,文献DOI怎么找? 2934469
邀请新用户注册赠送积分活动 1910752
关于科研通互助平台的介绍 1769202