立体声录音
计算机科学
消声室
对数
双音学
滤波器(信号处理)
声学
卷积神经网络
语音识别
感知
倒谱
Mel倒谱
人工智能
模式识别(心理学)
计算机视觉
特征提取
电信
数学
物理
数学分析
神经科学
扬声器
生物
作者
Jeramey Tyler,Mei Si,Jonas Braasch
摘要
This paper proposes an acoustic model for predicting the acoustical room characteristics from a running binaural signal. This is accomplished via training a convolutional neural network on a precedence effect model to extract the spatial locations of the direct sound source and its early reflections. The precedence effect model extends and modifies the BICAM algorithm with cepstral analysis [Tyler, J., Si, M., \& Braasch, J. Acoust. Soc. Am. 151] and a logarithmic filter. The logarithmic filter takes human perception into account and provides better separation at higher frequencies. A synthetic dataset of binaural signals was generated using anechoic orchestral recordings with added reflections and reverberations. The binaural model generates binaural activity maps from binaural input signals, which are then used to train a convolutional neural network. The ability to predict the traits of a direct sound source and its reflections has applications in academic areas like perceptual modeling and room acoustical analysis. It can also be applied to industrial areas such as television and movies, video games, and augmented and virtual reality, to name a few. [Work supported by the National Science Foundation: HCC-1909229.]
科研通智能强力驱动
Strongly Powered by AbleSci AI