计算机科学
人工智能
卷积神经网络
骨干网
目标检测
管道(软件)
相似性(几何)
模式识别(心理学)
姿势
计算机视觉
钥匙(锁)
图像(数学)
计算机网络
计算机安全
程序设计语言
作者
Yuan Li,Xinggang Wang,Wenyu Liu,Bin Feng
标识
DOI:10.1109/tcsvt.2019.2912620
摘要
This paper presents an effective single network for hand keypoint detection, instead of relying on the frequently-used two-stage pipeline consisting of localizing the hand and detecting the key points. Our method trains a fully convolutional neural network in an end-to-end manner, based on a novelly proposed pose anchor network, which can be deemed as an extension of the region proposal network (RPN) in Faster Region-based convolutional network. Moreover, we generate our pose anchor in a data-driven way, i.e., a K-means cluster algorithm based on object keypoint similarity (OKS), instead of manually design. In this way, we can obtain multiple representative pose anchors with various gestures, angles, and scales. By introducing the pose anchor, we are capable of utilizing the prior knowledge of the hand structure, mitigating the problem of occlusion to some extent. We demonstrate the feasibility and effectiveness of our method with extensive experiments on the challenging large-scale multiview 3D hand pose dataset (LSM-HPD) and New Zealand Sign Language Dataset (NZSL).
科研通智能强力驱动
Strongly Powered by AbleSci AI