计算机科学
人工智能
情绪识别
情绪分类
面部表情
情感计算
模式识别(心理学)
语音识别
特征提取
作者
Shuai Wang,Wenxuan Wang,Jinming Zhao,Shizhe Chen,Qin Jin,Shilei Zhang,Yong Qin
出处
期刊:International Conference on Multimodal Interfaces
日期:2017-11-03
卷期号:: 598-602
被引量:6
标识
DOI:10.1145/3136755.3143016
摘要
This paper presents our methods to the Audio-Video Based Emotion Recognition subtask in the 2017 Emotion Recognition in the Wild (EmotiW) Challenge. The task aims to predict one of the seven basic emotions for short video segments. We extract different features from audio and facial expression modalities. We also explore the temporal LSTM model with the input of frame facial features, which improves the performance of the non-temporal model. The fusion of different modality features and the temporal model lead us to achieve a 58.5% accuracy on the testing set, which shows the effectiveness of our methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI