面部表情
计算机科学
人工智能
特征(语言学)
三维人脸识别
计算机视觉
直方图
模式识别(心理学)
模式
面部识别系统
面子(社会学概念)
表达式(计算机科学)
幻觉
定向梯度直方图
语音识别
人脸检测
图像(数学)
社会科学
哲学
语言学
社会学
程序设计语言
作者
Junkai Chen,Zenghai Chen,Zheru Chi,Hong Fu
标识
DOI:10.1109/taffc.2016.2593719
摘要
Video based facial expression recognition has been a long standing problem and attracted growing attention recently. The key to a successful facial expression recognition system is to exploit the potentials of audiovisual modalities and design robust features to effectively characterize the facial appearance and configuration changes caused by facial motions. We propose an effective framework to address this issue in this paper. In our study, both visual modalities (face images) and audio modalities (speech) are utilized. A new feature descriptor called Histogram of Oriented Gradients from Three Orthogonal Planes (HOG-TOP) is proposed to extract dynamic textures from video sequences to characterize facial appearance changes. And a new effective geometric feature derived from the warp transformation of facial landmarks is proposed to capture facial configuration changes. Moreover, the role of audio modalities on recognition is also explored in our study. We applied the multiple feature fusion to tackle the video-based facial expression recognition problems under lab-controlled environment and in the wild, respectively. Experiments conducted on the extended Cohn-Kanade (CK+) database and the Acted Facial Expression in Wild (AFEW) 4.0 database show that our approach is robust in dealing with video-based facial expression recognition problems under lab-controlled environment and in the wild compared with the other state-of-the-art methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI