Advancements in end-to-end isolated Kannada ASR system by combining robust noise elimination technique and TDNN

计算机科学 语音识别 隐马尔可夫模型 噪音(视频) 卡纳达 人工神经网络 语音活动检测 人工智能 语音处理 图像(数学)
作者
Thimmaraja Yadava G,B. G. Nagaraja,H. S. Jayanna
出处
期刊:Intelligent systems with applications [Elsevier]
卷期号:20: 200288-200288
标识
DOI:10.1016/j.iswa.2023.200288
摘要

The demonstration of the latest enhancements in the end-to-end (E2E) isolated Kannada automatic speech recognition (ASR) system, achieved by combining a robust background noise elimination technique and a time delay neural network (TDNN), is presented in this work. An E2E Kannada ASR system consists of an interactive voice response system (IVRS), ASR models, and databases containing weather and agricultural commodity prices information. In the earlier spoken query system (SQS), the presence of babble, street noise, and other background noises led to a decrease in both offline and online speech recognition accuracies. To properly train the models, we increase the size of the database by collecting the Kannada speech data from an additional 500 farmers under real-time conditions. Moreover, the proposed noise elimination technique is employed to enhance the degraded speech data. Additionally, the efficacy of the TDNN is explored to improve the recognition accuracy of ASR models and the SQS system. The outcomes of the proposed speech enhancement algorithm demonstrate the absence of audible musical noise and other types of background noises in enhanced NOIZEUS and isolated Kannada speech databases. Leveraging Kannada language resources and the amalgamation of the proposed noise reduction technique and TDNN, a significant 1.1% enhancement in speech recognition accuracy is achieved in comparison with the previously developed deep neural network-hidden Markov model (DNN-HMM) based SQS. The enhanced isolated E2E ASR SQS system undergoes testing by 500 farmers, enabling them to access real-time agricultural commodity prices and weather forecasting information in their native Kannada language/dialects. This practical validation highlights the applicability and effectiveness of our advancements in real-world scenarios. The source code of proposed noise elimination technique, experimental results of ASR models and demo conversation of SQS are made publicly available at: https://sites.google.com/view/thimmarajayadavag/downloads
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
刚刚
伶俐世德发布了新的文献求助10
刚刚
李健应助happiness采纳,获得10
刚刚
1秒前
SciGPT应助愉快的雪巧采纳,获得10
1秒前
OUCER发布了新的文献求助10
1秒前
1秒前
Tangbing完成签到 ,获得积分10
2秒前
2秒前
Eating发布了新的文献求助10
3秒前
项惋清完成签到,获得积分10
4秒前
4秒前
左岸心诚发布了新的文献求助20
5秒前
5秒前
可耐的Gamma完成签到,获得积分10
6秒前
子车半烟发布了新的文献求助10
6秒前
小小泽完成签到,获得积分10
7秒前
7秒前
伶俐世德完成签到,获得积分10
7秒前
渣渣灰完成签到,获得积分10
8秒前
Alaskan发布了新的文献求助10
8秒前
8秒前
深绿浅蓝应助cmt采纳,获得10
9秒前
科目三应助Alaskan采纳,获得10
12秒前
13秒前
该死的论文完成签到,获得积分10
13秒前
13秒前
爆米花应助jiafang采纳,获得10
14秒前
隐形曼青应助优秀的素采纳,获得10
14秒前
研友_VZG7GZ应助谦让的樱采纳,获得10
14秒前
张家辉是卧底完成签到 ,获得积分10
14秒前
14秒前
深情安青应助青易采纳,获得10
16秒前
年轻的元菱应助机灵柚子采纳,获得10
16秒前
NexusExplorer应助castro采纳,获得10
17秒前
17秒前
18秒前
张铃仪发布了新的文献求助10
18秒前
juice完成签到,获得积分10
19秒前
ysq发布了新的文献求助10
19秒前
高分求助中
Manual of Clinical Microbiology, 4 Volume Set (ASM Books) 13th Edition 1000
Sport in der Antike 800
De arte gymnastica. The art of gymnastics 600
Berns Ziesemer - Maos deutscher Topagent: Wie China die Bundesrepublik eroberte 500
Stephen R. Mackinnon - Chen Hansheng: China’s Last Romantic Revolutionary (2023) 500
Sport in der Antike Hardcover – March 1, 2015 500
Boris Pesce - Gli impiegati della Fiat dal 1955 al 1999 un percorso nella memoria 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2422058
求助须知:如何正确求助?哪些是违规求助? 2111559
关于积分的说明 5345491
捐赠科研通 1839069
什么是DOI,文献DOI怎么找? 915501
版权声明 561201
科研通“疑难数据库(出版商)”最低求助积分说明 489590