计算机科学
多向性
到达方向
混响
麦克风阵列
人工神经网络
话筒
多层感知器
均方误差
感知器
信号(编程语言)
噪音(视频)
人工智能
模式识别(心理学)
语音识别
算法
数学
声学
方位角
统计
电信
声压
图像(数学)
物理
程序设计语言
天线(收音机)
几何学
作者
Xiong Xiao,Shengkui Zhao,Xionghu Zhong,Douglas L. Jones,Eng Siong Chng,Haizhou Li
出处
期刊:International Conference on Acoustics, Speech, and Signal Processing
日期:2015-04-01
卷期号:: 2814-2818
被引量:266
标识
DOI:10.1109/icassp.2015.7178484
摘要
This paper presents a learning-based approach to the task of direction of arrival estimation (DOA) from microphone array input. Traditional signal processing methods such as the classic least square (LS) method rely on strong assumptions on signal models and accurate estimations of time delay of arrival (TDOA) . They only work well in relatively clean conditions, but suffer from noise and reverberation distortions. In this paper, we propose a learning-based approach that can learn from a large amount of simulated noisy and reverberant microphone array inputs for robust DOA estimation. Specifically, we extract features from the generalised cross correlation (GCC) vectors and use a multilayer perceptron neural network to learn the nonlinear mapping from such features to the DOA. One advantage of the learning based method is that as more and more training data becomes available, the DOA estimation will become more and more accurate. Experimental results on simulated data show that the proposed learning based method produces much better results than the state-of-the-art LS method. The testing results on real data recorded in meeting rooms show improved root-mean-square error (RMSE) compared to the LS method.
科研通智能强力驱动
Strongly Powered by AbleSci AI