计算机科学
人工智能
模式识别(心理学)
动作识别
动作(物理)
频道(广播)
语音识别
计算机网络
物理
量子力学
班级(哲学)
作者
Baiqiao Zhang,Yanran Yuan,Wei Qin,Xiangxian Li,Weiying Liu,Wenxin Yao,Yulong Bian,Juan Liu
标识
DOI:10.1109/jbhi.2024.3511601
摘要
Stereotyped movements play a crucial role in diagnosing Autism Spectrum Disorder (ASD). However, recognizing them poses challenges, due to limited data availability and the movements' specificity and varying duration. To support in-depth analysis of ASD children's movements, we constructed the ACSA653 dataset, comprising 653 videos across six classes of stereotyped movements. This dataset surpasses existing ones in both scale and category. To improve the recognition of stereotyped movements, we propose APMFNet, a model that integrates three modules: Visual Motion Learning (VML), Skeleton Relation Mining (SRM), and Multi-channel Fusion (MF). The VML module focuses on extracting spatial and motion information from RGB and optical-flow sequences. The SRM module effectively mines essential motion patterns associated with stereotyped movements through cross-modal graph. The MF module fuses multi-modal information through cross-modality attention to facilitate decision-making. Tested on ACSA653, APMFNet outperforms current state-of-the-art methods, suggesting its potential to identify stable patterns of stereotyped movements in children with ASD.
科研通智能强力驱动
Strongly Powered by AbleSci AI