计算机科学
光谱图
卷积神经网络
光学(聚焦)
人工智能
情绪识别
语音识别
人工神经网络
时滞神经网络
跨步
模式识别(心理学)
情绪分类
深度学习
卷积(计算机科学)
特征(语言学)
光学
物理
计算机安全
作者
Taiba Majid Wani,Teddy Surya Gunawan,Syed B. Qadri,Hasmah Mansor,Mira Kartiwi,Nanang Ismail
标识
DOI:10.1109/icwt50448.2020.9243622
摘要
An assortment of techniques has been presented in the area of Speech Emotion Recognition (SER), where the main focus is to recognize the silent discriminants and useful features of speech signals. These features undergo the process of classification to recognize the specific emotion of a speaker. In recent times, deep learning techniques have emerged as a breakthrough in speech emotion recognition to detect and classify emotions. In this paper, we have modified a recently developed different network architecture of convolutional neural networks, i.e., Deep Stride Convolutional Neural Networks (DSCNN), by taking a smaller number of convolutional layers to increase the computational speed while still maintaining accuracy. Besides, we trained the state-of-art model of CNN and proposed DSCNN on spectrograms generated from the SAVEE speech emotion dataset. For the evaluation process, four emotions angry, happy, neutral, and sad, were considered. Evaluation results show that the proposed architecture DSCNN, with the prediction accuracy of 87.8%, outperforms CNN with 79.4% accuracy.
科研通智能强力驱动
Strongly Powered by AbleSci AI