希尔伯特-黄变换
噪音(视频)
计算机科学
语音识别
信噪比(成像)
能量(信号处理)
降噪
减法
语音增强
信号(编程语言)
算法
模式识别(心理学)
人工智能
数学
白噪声
统计
电信
图像(数学)
算术
程序设计语言
作者
Jin Wu,Gege Chong,Wenting Pang,Lei Wang
标识
DOI:10.1109/icnlp58431.2023.00029
摘要
Aiming at the problem that the correct rate of speech endpoint detection is low in the environment with low signal-to-noise ratio, a speech endpoint detection algorithm based on Empirical Mode Decomposition (EMD) and improved spectral subtraction is proposed, considering some noise reduction before endpoint detection. After EMD decomposition and reconstruction, the algorithm uses the improved spectral subtraction of multi-window spectral estimation to reduce noise, which improves the signal-to-noise ratio of speech signal, and then detects the endpoint by using the Teager energy and Zero-Crossing Rate(ZCR). The effectiveness and feasibility of the method presented in this paper are verified by the simulation experiment. The speech signals selected in the experiment were recorded in a quiet environment. Compared with the speech endpoint detection algorithm based on empirical modal decomposition and improved two-threshold method, the proposed algorithm has significantly improved the accuracy and accuracy of endpoint detection.
科研通智能强力驱动
Strongly Powered by AbleSci AI