可理解性(哲学)
语音识别
计算机科学
听力图
立体声录音
助听器
卷积神经网络
听力损失
稳健性(进化)
听力学
人工智能
医学
哲学
认识论
生物化学
化学
基因
作者
Candy Olivia Mawalim,Benita Angela Titalim,Shogo Okada,Masashi Unoki
标识
DOI:10.1016/j.apacoust.2023.109663
摘要
Speech intelligibility prediction methods are necessary for hearing aid development. However, many such prediction methods are categorized as intrusive metrics because they require reference speech as input, which is often unavailable in real-world situations. Additionally, the processing techniques in hearing aids may cause temporal or frequency shifts, which degrade the accuracy of intrusive speech intelligibility metrics. This paper proposes a non-intrusive auditory model for predicting speech intelligibility under hearing loss conditions. The proposed method requires binaural signals from hearing aids and audiograms representing the hearing conditions of hearing-impaired listeners. It also includes additional acoustic features to improve the method's robustness in noisy and reverberant environments. A two-dimensional convolutional neural network with neural decision forests is used to construct a speech intelligibility prediction model. An evaluation conducted with the first Clarity Prediction Challenge dataset shows that the proposed method performs better than the baseline system.
科研通智能强力驱动
Strongly Powered by AbleSci AI