计算机科学
人工智能
平滑度
图像(数学)
计算机视觉
集成学习
模式识别(心理学)
数学
数学分析
作者
Jingze Feng,Chong He,Guorui Wang,Meili Wang
摘要
Abstract In computer vision, the task of 3D reconstruction from monocular sparse images poses significant challenges, particularly in the field of animal modelling. The diverse morphology of animals, their varied postures, and the variable conditions of image acquisition significantly complicate the task of accurately reconstructing their 3D shape and pose from a monocular image. To address these complexities, we propose S‐LASSIE, a novel technique for 3D reconstruction of quadrupeds from monocular sparse images. It requires only 10–30 images of similar breeds for training. To effectively mitigate depth ambiguities inherent in monocular reconstructions, S‐LASSIE employs a multi‐angle projection loss function. In addition, our approach, which involves fusion and smoothing of bone structures, resolves issues related to disjointed topological structures and uneven connections at junctions, resulting in 3D models with comprehensive topologies and improved visual fidelity. Our extensive experiments on the Pascal‐Part and LASSIE datasets demonstrate significant improvements in keypoint transfer, overall 2D IOU and visual quality, with an average keypoint transfer and overall 2D IOU of 59.6% and 86.3%, respectively, which are superior to existing techniques in the field.
科研通智能强力驱动
Strongly Powered by AbleSci AI