A Novel Emotion-Aware Method Based on the Fusion of Textual Description of Speech, Body Movements, and Facial Expressions

计算机科学 面部表情 人工智能 语音识别 保险丝(电气) 人工神经网络 代表(政治) 自然语言处理 模式识别(心理学) 政治学 政治 电气工程 工程类 法学
作者
Guanglong Du,Yuwen Zeng,Kang Su,Chunquan Li,Xueqian Wang,Shaohua Teng,Di Li,Peter Liu
出处
期刊:IEEE Transactions on Instrumentation and Measurement [Institute of Electrical and Electronics Engineers]
卷期号:71: 1-16 被引量:13
标识
DOI:10.1109/tim.2022.3204940
摘要

Emotion computing is a necessary part of advanced human–computer interaction. An appropriate description of a character's facial expressions, body languages, and speaking styles in novels always enables readers to infer the character's emotions. Moreover, multimodal information is complementary and integrated. Fusing the information from multiple modes into a textual modal can get better fusion results and overcome the bias of understanding the unimodal information. Inspired by these facts, we develop a novel emotion-aware method by the fusion of textual description of speech, body movements, and facial expression, which reduces the dimensionality of speech, body movements, and facial expressions by unifying three types of information into a unified component. Specifically, to fuse multimodel features for emotion recognition, we propose a two-stage neural network. First, bidirectional long short-term memory-conditional random fields (Bi-LSTM-CRF) and back-propagation neural network (BPNN) are used to analyze the extracted vocal and visual features of facial expressions, body movements, and speeches, which aims to obtain textual descriptions of different features. Second, the textual descriptions of the features are fused through a neural network with a self-organization map (SOM) layer and are used to compensate layers that are trained by web-based corpus. The advantages of this method are to utilize depth information to track facial and bodily movement, and employ an explainable textual intermediate representation to fuse the features. We experimentally tested the emotion-aware system in real-world applications, and the results indicate that our system can quickly and steadily recognize human emotions. Compared with other unimodal and multimodal-fusion algorithms, our method is more precise, which can improve the accuracy by up to 30% compared with the unimodal method.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
写成完成签到,获得积分10
8秒前
General发布了新的文献求助10
8秒前
Orange应助畔畔采纳,获得200
9秒前
12秒前
调皮的雨灵关注了科研通微信公众号
13秒前
13秒前
15秒前
Aikebaier0127发布了新的文献求助10
15秒前
李健的小迷弟应助先字母采纳,获得10
15秒前
16秒前
16秒前
辛坦夫完成签到,获得积分10
16秒前
16秒前
mmmmmagic完成签到,获得积分10
18秒前
18秒前
Ning发布了新的文献求助10
18秒前
NexusExplorer应助风趣的绿茶采纳,获得10
19秒前
19秒前
xiaolizi发布了新的文献求助10
20秒前
ANDW发布了新的文献求助10
25秒前
27秒前
Aikebaier0127完成签到,获得积分20
27秒前
28秒前
美满的初之完成签到,获得积分10
29秒前
shiyu完成签到,获得积分10
30秒前
www发布了新的文献求助10
31秒前
32秒前
迷人的天抒完成签到,获得积分10
32秒前
32秒前
牧林听风完成签到 ,获得积分10
34秒前
34秒前
suxiang完成签到,获得积分10
34秒前
彭于晏应助Ning采纳,获得10
34秒前
愉快问筠完成签到 ,获得积分10
34秒前
山海应助认真雪曼采纳,获得10
35秒前
35秒前
36秒前
反复发作发布了新的文献求助10
36秒前
ANDW发布了新的文献求助10
37秒前
啦啦啦完成签到,获得积分10
37秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Development Across Adulthood 1000
Chemistry and Physics of Carbon Volume 18 800
The formation of Australian attitudes towards China, 1918-1941 660
Signals, Systems, and Signal Processing 610
天津市智库成果选编 600
全相对论原子结构与含时波包动力学的理论研究--清华大学 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6450438
求助须知:如何正确求助?哪些是违规求助? 8262759
关于积分的说明 17604210
捐赠科研通 5514621
什么是DOI,文献DOI怎么找? 2903319
邀请新用户注册赠送积分活动 1880372
关于科研通互助平台的介绍 1722090