计算机科学
面部表情识别
人工智能
图形
面部表情
卷积神经网络
模式识别(心理学)
计算机视觉
语音识别
面部识别系统
理论计算机科学
作者
F. Wang,Zhao Liu,Jie Lei,Zeyu Zou,Wentao Han,Juan Xu,Xuan Li,Zunlei Feng,Ronghua Liang
标识
DOI:10.1007/978-3-031-53308-2_4
摘要
Most of the current methods for video-based facial expression recognition (FER) in the wild are based on deep neural networks with attention mechanism to capture the relationships between frames. However, these methods suffer from the large variations of expression patterns and data uncertainties. This paper proposes a Dynamic-Static Graph Convolutional Network (DSGCN), which mainly consists of a Static-Relational graph (SRG) and a Dynamic-Relational graph (DRG). The SRG aims to guide the network to learn the static spatial relationship of facial expressions in each video frame, strengthening the salient areas of the face through the dependencies of context nodes. The DRG learns the dynamic temporal relationship of facial expressions by aggregating video sequence features, constructing a graph with other samples within a batch to share facial expression features with different contexts, thus promoting feature diversity to improve robustness. The proposed DSGCN framework achieves state-of-the-art results on the FERV39K, DFEW and AFEW benchmarks, and ablation experiments verify the effectiveness of each module.
科研通智能强力驱动
Strongly Powered by AbleSci AI