变压器
计算机科学
直觉
运动捕捉
动作识别
RGB颜色模型
人工智能
计算机视觉
运动(物理)
机器学习
模式识别(心理学)
工程类
电压
电气工程
哲学
认识论
班级(哲学)
作者
Magnus Ibh,Stella Graßhof,Dan Witzner,Pascal Madeleine
标识
DOI:10.1109/cvprw59228.2023.00548
摘要
This paper presents TemPose, a novel skeleton-based transformer model designed for fine-grained motion recognition to improve understanding of the detailed player actions in badminton. The model utilizes multiple temporal and interaction layers to capture variable-length multi-person human actions while minimizing reliance on non-human visual context. TemPose is evaluated on two fine-grained badminton datasets, where it significantly outperforms other baseline models by incorporating additional input streams, such as the shuttlecock position, into the temporal transformer layers of the model. Additionally, TemPose demonstrates great versatility by achieving competitive results compared to other state-of-the-art skeleton-based models on the large-scale action recognition benchmark NTU RGB+D. Experiments are conducted to explore how different model parameter configurations affect Tem-Pose's performance. Additionally, a qualitative analysis of the temporal attention maps suggests that the model learns to prioritize frames of specific poses relevant to different actions while formulating an intuition of each individual's importance in the sequences. Overall, TemPose is an intuitive and versatile architecture that has the potential to be further developed and incorporated into other methods for managing human motion in sports with state-of-the-art results.
科研通智能强力驱动
Strongly Powered by AbleSci AI