A Multimodal Driver Emotion Recognition Algorithm Based on the Audio and Video Signals in Internet of Vehicles Platform

计算机科学 判别式 特征(语言学) 特征提取 语音识别 人工智能 哲学 语言学
作者
Na Ying,Yinhe Jiang,Chunsheng Guo,Di Zhou,Jian Zhao
出处
期刊:IEEE Internet of Things Journal [Institute of Electrical and Electronics Engineers]
卷期号:11 (22): 35812-35824 被引量:11
标识
DOI:10.1109/jiot.2024.3363176
摘要

Driving can take up a substantial part of daily life and frequently trigger negative emotions like anger or anxiety, which have a significant adverse impact on driving safety as well as long-term human health. To identify driver emotions, thereby improving the safety and humanization of intelligent driving, we explore how to model the discriminative emotion features from both speech and facial expressions in this work. More specifically, an effective attention-based network for facial expression and a lightweight speech emotion network are proposed, separately. Then, audio and video features are combined at the feature level to construct our multimodal driver emotion recognition model. This paper proposes a new audio feature extractor that uses a multi-scale residual structure to extract spectrogram features. In terms of video, a set of frame sequences using Local Binary Pattern Histograms (LBPH) is obtained through preprocessing, which generates a fixed-dimensional feature representation. These features are then input into a fine-tuned ResNet18 model to analyze spatial information. This model is further augmented by integrating both a temporal attention module and a Gated Recurrent Unit (GRU), enhancing its capability to create a highly discriminative video representation. Additionally, we propose an Internet of Vehicles (IoV) platform, specifically designed for driver emotion recognition. The IoV platform consists of sensor layer, data acquisition and transport layer, server layer and data application layer. The IoV platform uses sensors to collect multimodal data from drivers, which can provide data support for the proposed multimodal driver emotion recognition algorithm. The performance of this proposed algorithm is evaluated on two multimodal emotional datasets, Ryerson Audio-Visual Dataset of Emotional Speech and Song (RAVDESS) and Surrey Audio-Visual Expressed Emotion (SAVEE), using a variety of performance indicators. Compared to other baseline methods, this proposed multimodal model achieves state-of-the-art results on the RAVDESS and SAVEE datasets, demonstrating superior recognition accuracy with rates of 0.93 and 0.99, respectively. Additionally, it exhibits precision scores of 0.93 on RAVDESS and 0.99 on SAVEE, along with exceptional specificity scores of 0.99 and 1.00, respectively.

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
PDF的下载单位、IP信息已删除 (2025-6-4)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
JH发布了新的文献求助10
刚刚
1秒前
chuizi90完成签到,获得积分10
1秒前
清清完成签到,获得积分10
2秒前
4秒前
大壮完成签到 ,获得积分10
4秒前
JL发布了新的文献求助10
4秒前
5秒前
炒米粉完成签到,获得积分10
5秒前
Jasper应助安然采纳,获得10
5秒前
chuizi90发布了新的文献求助10
6秒前
ann28251发布了新的文献求助10
6秒前
7秒前
zxl发布了新的文献求助10
8秒前
李健应助俏皮绝山采纳,获得10
10秒前
51H发布了新的文献求助10
10秒前
Edward完成签到 ,获得积分10
11秒前
子车茗应助gtxy采纳,获得20
11秒前
11秒前
生动安波发布了新的文献求助10
12秒前
zbs应助科研通管家采纳,获得10
12秒前
馆长应助科研通管家采纳,获得30
12秒前
脑洞疼应助科研通管家采纳,获得10
12秒前
Owen应助科研通管家采纳,获得10
12秒前
星辰大海应助科研通管家采纳,获得10
12秒前
李爱国应助科研通管家采纳,获得10
13秒前
13秒前
科研通AI5应助科研通管家采纳,获得10
13秒前
天天快乐应助科研通管家采纳,获得10
13秒前
zbs应助科研通管家采纳,获得10
13秒前
科目三应助科研通管家采纳,获得10
13秒前
浮游应助迅速的寻绿采纳,获得10
13秒前
Nekros应助科研通管家采纳,获得10
13秒前
领导范儿应助科研通管家采纳,获得10
13秒前
JamesPei应助科研通管家采纳,获得10
13秒前
winwinhhh发布了新的文献求助10
13秒前
14秒前
14秒前
15秒前
16秒前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
高温高圧下融剤法によるダイヤモンド単結晶の育成と不純物の評価 5000
Treatise on Geochemistry (Third edition) 1600
Vertebrate Palaeontology, 5th Edition 500
ISO/IEC 24760-1:2025 Information security, cybersecurity and privacy protection — A framework for identity management 500
碳捕捉技术能效评价方法 500
Optimization and Learning via Stochastic Gradient Search 500
热门求助领域 (近24小时)
化学 医学 生物 材料科学 工程类 有机化学 内科学 生物化学 物理 计算机科学 纳米技术 遗传学 基因 复合材料 化学工程 物理化学 病理 催化作用 免疫学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 4715142
求助须知:如何正确求助?哪些是违规求助? 4077634
关于积分的说明 12611151
捐赠科研通 3780780
什么是DOI,文献DOI怎么找? 2088447
邀请新用户注册赠送积分活动 1114792
科研通“疑难数据库(出版商)”最低求助积分说明 992000