计算机科学
人工智能
模式识别(心理学)
杠杆(统计)
卷积(计算机科学)
动作识别
骨架(计算机编程)
突出
图形
计算机视觉
特征(语言学)
人体骨骼
特征提取
理论计算机科学
人工神经网络
哲学
程序设计语言
班级(哲学)
语言学
作者
Wenfeng Song,Tangli Chu,Shuai Li,Nannan Li,Aimin Hao,Hong Qin
标识
DOI:10.1109/tmm.2023.3324835
摘要
Skeleton-based action recognition is crucial for natural human-computer interaction, dynamic behavior analysis, and behavior surveillance. The key challenge is to effectively capture the intrinsic local-global clues of the activity. However, it remains challenging to efficiently leverage multidimensional information related to joints' local visual appearances, global spatial relationships, and coherent temporal cues. To address this challenge, we propose a joints-centered spatial-temporal feature-fused framework for action recognition, which exploits skeleton-based graph diffusion and convolution. Specifically, we employ Partial Differential Equation (PDE) based skeleton graph diffusion to automatically activate and diffuse the salient appearance features of joints. This approach simultaneously integrates the joints' appearance clues and their hierarchical relationships at both the super-pixel level and structure level. The diffused appearance-related features of the joints are further fused with skeleton-related spatial-temporal features, and the resulting fused features are fed into a skeleton convolution network for action recognition. Our method was extensively evaluated on two public datasets (NTU-RGBD and UWA3D), and the results demonstrate the improved accuracy and effectiveness of our approach. Our code will be public.
科研通智能强力驱动
Strongly Powered by AbleSci AI