语音活动检测
语音识别
能量(信号处理)
计算机科学
信号(编程语言)
语音增强
噪音(视频)
信噪比(成像)
背景噪声
探测理论
语音处理
人工智能
电信
探测器
数学
统计
图像(数学)
程序设计语言
作者
Ekaterina Verteletskaya,Kirill Sakhnov
摘要
This paper describes a study of noise-robust voice activity detection (VAD) utilizing the periodicity of the signal, full band signal energy and high band to low band signal energy ratio. Conventional VADs are sensitive to a variably noisy environment especially with low SNR, and also result in cutting off unvoiced regions of speech as well as random oscillating of output VAD decisions. To overcome these problems, the proposed algorithm first identifies voiced regions of speech and then differentiates unvoiced regions from silence or background noise using the energy ratio and total signal energy. The performance of the proposed VAD algorithm is tested on real speech signals. Comparisons confirm that the proposed VAD algorithm outperforms the conventional VAD algorithms, especially in the presence of background noise.
科研通智能强力驱动
Strongly Powered by AbleSci AI