人工智能
计算机科学
单眼
稳健性(进化)
计算机视觉
姿势
杠杆(统计)
旋转(数学)
一般化
标记数据
一致性(知识库)
模式识别(心理学)
数学
数学分析
化学
基因
生物化学
作者
Daiheng Gao,Xindi Zhang,Xingyu Chen,Andong Tan,Bang Zhang,Pan Pan,Ping Tan
标识
DOI:10.1145/3503161.3547828
摘要
Current methods for 3D hand pose estimation fail to generalize well to in-the-wild new scenarios due to varying camera viewpoints, self-occlusions, and complex environments. To address this problem, we propose CycleHand to improve the generalization ability of the model in a self-supervised manner. Our motivation is based on an observation: if one globally rotates the whole hand and reversely rotates it back, the estimated 3D poses of fingers should keep consistent before and after the rotation because the wrist-relative hand poses stay unchanged during global 3D rotation. Hence, we propose arbitrary-rotation self-supervised consistency learning to improve the model's robustness for varying viewpoints. Another innovation of CycleHand is that we propose a high-fidelity texture map to render the photorealistic rotated hand with different lighting conditions, backgrounds, and skin tones to further enhance the effectiveness of our self-supervised task. To reduce the potential negative effects brought by the domain shift of synthetic images, we use the idea of contrastive learning to learn a synthetic-real consistent feature extractor in extracting domain-irrelevant hand representations. Experiments show that CycleHand can largely improve the hand pose estimation performance in both canonical datasets and real-world applications.
科研通智能强力驱动
Strongly Powered by AbleSci AI