噪音(视频)
语音识别
计算机科学
变换矩阵
转化(遗传学)
适应(眼睛)
话语
插值(计算机图形学)
模式识别(心理学)
人工智能
运动(物理)
心理学
生物化学
化学
物理
运动学
经典力学
图像(数学)
基因
神经科学
作者
A. Wakao,Kazuya Takeda,Fumitada Itakura
标识
DOI:10.1109/icslp.1996.607192
摘要
The variability of Lombard speech under different noise conditions and an adaptation method for the different Lombard speech are discussed. For this purpose, various kinds of Lombard speech are recorded under different conditions of noise injected into a earphone with controlled feedback of voice. First, DTW word recognition experiments using clean speech as a reference are performed to show that the higher the noise level becomes the more seriously the utterance is affected. The second linear transformation of the cepstral feature vector is tested to show that when given enough (more than 100 words) training data, the transformation matrix can be correctly learned for each of the noise conditions. Interpolation of the transfer matrix is then proposed in order to reduce the adaptation parameter and number of training samples. The authors show, finally, that five words are enough for the learning interpolated transformation matrix for unknown noise conditions.
科研通智能强力驱动
Strongly Powered by AbleSci AI