计算机科学
人工智能
模式识别(心理学)
卷积神经网络
手势识别
手势
残余物
骨架(计算机编程)
RGB颜色模型
分类器(UML)
光学(聚焦)
计算机视觉
算法
物理
光学
程序设计语言
作者
Chi Lin,Jun Wan,Yanyan Liang,Stan Z. Li
标识
DOI:10.1109/fg.2018.00018
摘要
In this paper, we focus on large-scale isolated gesture recognition for RGB-D videos. We develop a novel ensemble method to explore deep spatio-temporal features using 3D Convolutional Neural Networks (CNNs) with residual architecture (Res-C3D) and build a time-series model with skeleton information based on Long Short Term Memory network (LSTM). First, relative positions and angles of different keypoints are extracted and used to build time-series model in LSTM. Obtaining the skeleton information (keypoints) of body and reserving arm regions with discarding other parts, masked Res-C3D is obtained, which decreases the effect of the background and other variations, as gestures are mainly derived from the arm or hand movements. Moreover, the weights of each voting sub-classifier being of advantage to a certain class in our ensemble model are adaptively obtained by training in place of fixed weights. Our experimental results show that the proposed method has obtained a state-of-the-art performance with accuracy 0.6842 in the IsoGD dataset.
科研通智能强力驱动
Strongly Powered by AbleSci AI