混响
计算机科学
稳健性(进化)
麦克风阵列
到达方向
卷积神经网络
语音识别
人工智能
深度学习
噪音(视频)
话筒
模式识别(心理学)
人工神经网络
声学
电信
天线(收音机)
基因
声压
图像(数学)
物理
生物化学
化学
作者
Qinglong Li,Xueliang Zhang,Hao Li
标识
DOI:10.1109/icassp.2018.8461386
摘要
Direction of arrival (DOA) estimation is an important topic in microphone array processing. Conventional methods work well in relatively clean conditions but suffer from noise and reverberation distortions. Recently, deep learning-based methods show the robustness to noise and reverberation. However, the performance is degraded rapidly or even model cannot work when microphone array structure changes. So it has to retrain the model with new data, which is a huge work. In this paper, we propose a supervised learning algorithm for DOA estimation combining convolutional neural network (CNN) and long short term memory (LSTM). Experimental results show that the proposed method can improve the accuracy significantly. In addition, due to an input feature design, the proposed method can adapt to a new microphone array conveniently only use a very small amount of data.
科研通智能强力驱动
Strongly Powered by AbleSci AI