计算机科学
RGB颜色模型
水准点(测量)
人工智能
计算机视觉
语义学(计算机科学)
忠诚
比例(比率)
点云
计算机图形学(图像)
地理
地图学
电信
程序设计语言
作者
Chandan Yeshwanth,Yueh-Cheng Liu,Matthias Nießner,Angela Dai
标识
DOI:10.1109/iccv51070.2023.00008
摘要
We present ScanNet++, a large-scale dataset that couples together capture of high-quality and commodity-level geometry and color of indoor scenes. Each scene is captured with a high-end laser scanner at sub-millimeter resolution, along with registered 33-megapixel images from a DSLR camera, and RGB-D streams from an iPhone. Scene reconstructions are further annotated with an open vocabulary of semantics, with label-ambiguous scenarios explicitly annotated for comprehensive semantic understanding. ScanNet++ enables a new real-world benchmark for novel view synthesis, both from high-quality RGB capture, and importantly also from commodity-level images, in addition to a new benchmark for 3D semantic scene understanding that comprehensively encapsulates diverse and ambiguous semantic labeling scenarios. Currently, ScanNet++ contains 460 scenes, 280,000 captured DSLR images, and over 3.7M iPhone RGBD frames.
科研通智能强力驱动
Strongly Powered by AbleSci AI