计算机科学
先验概率
培训(气象学)
点云
云计算
人工智能
点(几何)
图像(数学)
计算机视觉
计算机图形学(图像)
数学
贝叶斯概率
几何学
操作系统
物理
气象学
作者
Zhengyu Li,Yao Wu,Yanyun Qu
标识
DOI:10.1007/978-981-99-8429-9_11
摘要
Self-Supervised Learning (SSL) is a viable technique to unleash the scalability and generalization of the network. Nevertheless, the representations learned by existing 3D SSL are still insufficient for point cloud shape analysis. Compared to point clouds, images have better strength in providing fine-grained semantic information. In this paper, we propose a framework of cross-modal SSL, enabling knowledge transfer across modalities. Specifically, two self-supervised tasks are meticulously designed. The former constructs instance-level consistency, which computes the similarity between objects to discriminate instances of different modalities. The latter constructs cluster-level consistency, which mines category coherence by grouping point clouds and corresponding images into the same semantic regions. By jointly learning feature representations and cluster assignments, the model incorporates the latent category information to reduce the intra-cluster variance as well as increase the inter-cluster variance. Extensive experimental results demonstrate the effectiveness on three representative datasets.
科研通智能强力驱动
Strongly Powered by AbleSci AI