舞蹈
计算机科学
稳健性(进化)
人工智能
情态动词
光学(聚焦)
运动捕捉
机器学习
运动(物理)
人机交互
艺术
生物化学
化学
物理
文学类
高分子化学
光学
基因
作者
Zhong Yun,Fan Zhang,Yiannis Demiris
标识
DOI:10.1109/icassp49357.2023.10096824
摘要
A fundamental challenge of analyzing human motion is to effectively represent human movements both spatially and temporally. We propose a contrastive self-supervised strategy to tackle this challenge. Particularly, we focus on dancing, which involves a high level of physical and intellectual abilities. Firstly, we deploy Graph and Residual Neural Networks with Siamese architecture to represent the dance motion and music features respectively. Secondly, we apply the InfoNCE loss to contrastively embed the high-dimensional multimedia signals onto the latent space without label supervision. Finally, our proposed framework is evaluated on a multi-modal Dance- Music-Level dataset composed of various dance motions, music, genres and choreographies with dancers of different expertise levels. Experimental results demonstrate the robustness and improvements of our proposed method over 3 baselines and 6 ablation studies across tasks of dance genres, choreographies classification and dancer expertise level assessment.
科研通智能强力驱动
Strongly Powered by AbleSci AI