Att-Net: Enhanced emotion recognition system using lightweight self-attention module

计算机科学 卷积神经网络 人工智能 模式识别(心理学) 计算 感知器 深度学习 语音识别 人工神经网络 算法
作者
Mustaqeem Mustaqeem,Soonil Kwon
出处
期刊:Applied Soft Computing [Elsevier BV]
卷期号:102: 107101-107101 被引量:124
标识
DOI:10.1016/j.asoc.2021.107101
摘要

Speech emotion recognition (SER) is an active research field of digital signal processing and plays a crucial role in numerous applications of Human–computer interaction (HCI). Nowadays, the baseline state of the art systems has quite a low accuracy and high computations, which needs upgrading to make it reasonable for real-time industrial uses such as detection of content from speech data. The main intent for low recognition rate and high computational cost is a scarceness of datasets, model configuration, and patterns recognition that is the supreme stimulating work for building a robust SER system. In this study, we address these problems and propose a simple and lightweight deep learning-based self-attention module (SAM) for SER system. The transitional features map is given to SAM, which produces efficiently the channel and spatial axes attention map with insignificant overheads. We use a multi-layer perceptron (MLP) in channel attention to extracting global cues and a special dilated convolutional neural network (CNN) in spatial attention to extract spatial info from input tensor. Moreover, we merge, spatial and channel attention maps to produce a combine attention weights as a self-attention module. We placed SAM in the middle of convolutional and connected layers and trained it in an end-to-end mode. The ablation study and comprehensive experimentations are accompanied over IEMOCAP, RAVDESS, and EMO-DB speech emotion datasets. The proposed SER system shows consistent improvements in overall experiments for all datasets and shows 78.01%, 80.00%, and 93.00% average recall, respectively.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
Ava应助古涛采纳,获得10
刚刚
7yin秦发布了新的文献求助10
1秒前
Orange应助傲娇香氛采纳,获得10
2秒前
在水一方应助寻舟者采纳,获得10
3秒前
汉堡包应助wwt采纳,获得10
4秒前
5秒前
7秒前
阿蒋完成签到,获得积分20
8秒前
zghild完成签到,获得积分10
8秒前
8秒前
11111发布了新的文献求助10
9秒前
9秒前
lsc发布了新的文献求助10
9秒前
冯利发布了新的文献求助10
10秒前
10秒前
22完成签到,获得积分10
10秒前
10秒前
科研通AI6.1应助不倦采纳,获得10
11秒前
Smith发布了新的文献求助10
12秒前
852应助弑月采纳,获得10
13秒前
lucky发布了新的文献求助10
14秒前
tracy发布了新的文献求助10
15秒前
寻舟者发布了新的文献求助10
15秒前
Orange应助潇洒的惋清采纳,获得10
16秒前
领导范儿应助潇洒的惋清采纳,获得10
16秒前
务实山菡完成签到,获得积分10
17秒前
17秒前
徐科研完成签到,获得积分10
17秒前
周雪艳完成签到,获得积分10
17秒前
寻舟者完成签到,获得积分10
21秒前
不倦发布了新的文献求助20
22秒前
Orange应助潇洒的惋清采纳,获得10
22秒前
Ava应助潇洒的惋清采纳,获得10
22秒前
今后应助潇洒的惋清采纳,获得10
22秒前
丘比特应助潇洒的惋清采纳,获得10
23秒前
bkagyin应助潇洒的惋清采纳,获得10
23秒前
斯文败类应助潇洒的惋清采纳,获得10
23秒前
23秒前
科目三应助潇洒的惋清采纳,获得10
23秒前
Jasper应助W_H采纳,获得10
23秒前
高分求助中
Adhesion Science: Principles & Practice 1234
Signals, Systems, and Signal Processing 610
The Resilient Mindset 400
Impact of Storage Orientation and Duration on Prefilled Syringe Performance: Break-Loose and Glide Forces, and Injection Time Across Multiple Time Points 360
Programming for Chemical Engineers Using C, C++, and MATLAB 300
Upland Kenya wild flowers and ferns: a flora of the flowers, ferns, grasses, and sedges of highland Kenya 300
Disturbing the Quiet Life? Competition and CEO Incentives 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6654663
求助须知:如何正确求助?哪些是违规求助? 8407797
关于积分的说明 17977435
捐赠科研通 5851462
什么是DOI,文献DOI怎么找? 2972352
邀请新用户注册赠送积分活动 1948146
关于科研通互助平台的介绍 1869349