人工智能
计算机科学
RGB颜色模型
面部识别系统
面子(社会学概念)
模式识别(心理学)
特征(语言学)
分割
计算机视觉
三维人脸识别
特征提取
人脸检测
社会科学
语言学
哲学
社会学
作者
Meng-Tzu Chiu,Hsun-Ying Cheng,Chien-Yi Wang,Shang‐Hong Lai
出处
期刊:IEEE transactions on biometrics, behavior, and identity science
[Institute of Electrical and Electronics Engineers]
日期:2023-01-09
卷期号:5 (3): 334-347
被引量:5
标识
DOI:10.1109/tbiom.2022.3233769
摘要
Deep learning approaches achieve highly accurate face recognition by training the models with huge face image datasets. Unlike 2D face image datasets, there is a lack of large 3D face datasets available to the public. Existing public 3D face datasets were usually collected with few subjects, leading to the over-fitting problem. This paper proposes two CNN models to improve the RGB-D face recognition task. The first is a segmentation-aware depth estimation network, called DepthNet, which estimates depth maps from RGB face images by exploiting semantic segmentation for more accurate face region localization. The other is a novel segmentation-guided RGB-D face recognition model that contains an RGB recognition branch, a depth map recognition branch, and an auxiliary segmentation mask branch. In our multi-modality face recognition model, a feature disentanglement scheme is employed to factorize the feature representation into identity-related and style-related components. DepthNet is applied to augment a large 2D face image dataset to a large RGB-D face dataset, which is used for training our RGB-D face recognition model. Our experimental results show that DepthNet can produce more reliable depth maps from face images with the segmentation mask. Our multi-modality face recognition model fully exploits the depth map and outperforms state-of-the-art methods on several public 3D face datasets with challenging variations.
科研通智能强力驱动
Strongly Powered by AbleSci AI