计算机科学
混合(物理)
面部表情
人工智能
面部表情识别
表达式(计算机科学)
面部识别系统
模式识别(心理学)
语音识别
计算机视觉
物理
量子力学
程序设计语言
作者
Ryosuke Kawamura,H. Hayashi,Noriko Takemura,Hajime Nagahara
标识
DOI:10.1109/wacv57701.2024.00642
摘要
Dynamic facial expression recognition (DFER) is an important task in the field of computer vision. To apply automatic DFER in practice, it is necessary to accurately recognize ambiguous facial expressions, which often appear in data in the wild. In this paper, we propose MIDAS, a data augmentation method for DFER, which augments ambiguous facial expression data with soft labels consisting of probabilities for multiple emotion classes. In MIDAS, the training data are augmented by convexly combining pairs of video frames and their corresponding emotion class labels, which can also be regarded as an extension of mixup to soft-labeled video data. This simple extension is remarkably effective in DFER with ambiguous facial expression data. To evaluate MIDAS, we conducted experiments on the DFEW dataset. The results demonstrate that the model trained on the data augmented by MIDAS outperforms the existing state-of-the-art method trained on the original dataset.
科研通智能强力驱动
Strongly Powered by AbleSci AI