计算机视觉
对象(语法)
图像(数学)
计算机科学
人工智能
计算机图形学(图像)
作者
Dmitry Tochilkin,David Pankratz,Zexiang Liu,Zixuan Huang,Adam Letts,Yangguang Li,Liang Ding,Christian Laforte,Varun Jampani,Yan–Pei Cao
出处
期刊:Cornell University - arXiv
日期:2024-03-04
被引量:10
标识
DOI:10.48550/arxiv.2403.02151
摘要
This technical report introduces TripoSR, a 3D reconstruction model leveraging transformer architecture for fast feed-forward 3D generation, producing 3D mesh from a single image in under 0.5 seconds. Building upon the LRM network architecture, TripoSR integrates substantial improvements in data processing, model design, and training techniques. Evaluations on public datasets show that TripoSR exhibits superior performance, both quantitatively and qualitatively, compared to other open-source alternatives. Released under the MIT license, TripoSR is intended to empower researchers, developers, and creatives with the latest advancements in 3D generative AI.
科研通智能强力驱动
Strongly Powered by AbleSci AI