计算机科学
特征提取
音频信号
语音识别
信号处理
音频信号处理
频域
小波
时域
领域(数学分析)
模式识别(心理学)
人工智能
倒谱
语音处理
特征(语言学)
信号(编程语言)
音频信号流
Mel倒谱
数字信号处理
语音编码
计算机视觉
数学
数学分析
计算机硬件
语言学
哲学
程序设计语言
作者
Garima Sharma,Kartikeyan Umapathy,Sridhar Krishnan
标识
DOI:10.1016/j.apacoust.2019.107020
摘要
Audio signal processing algorithms generally involves analysis of signal, extracting its properties, predicting its behaviour, recognizing if any pattern is present in the signal, and how a particular signal is correlated to another similar signals. Audio signal includes music, speech and environmental sounds. Over the last few decades, audio signal processing has grown significantly in terms of signal analysis and classification. And it has been proven that solutions of many existing issues can be solved by integrating the modern machine learning (ML) algorithms with the audio signal processing techniques. The performance of any ML algorithm depends on the features on which the training and testing is done. Hence feature extraction is one of the most vital part of a machine learning process. The aim of this study is to summarize the literature of the audio signal processing specially focusing on the feature extraction techniques. In this survey the temporal domain, frequency domain, cepstral domain, wavelet domain and time-frequency domain features are discussed in detail.
科研通智能强力驱动
Strongly Powered by AbleSci AI