Model Generation of Accented Speech using Model Transformation and Verification for Bilingual Speech Recognition

计算机科学 语音识别 转化(遗传学) 人工智能 参数统计 隐马尔可夫模型 声学模型 压力(语言学) 自然语言处理 语音处理 数学 生物化学 基因 统计 化学
作者
Han-Ping Shen,Chung-Hsien Wu,Pei Shan Tsai
标识
DOI:10.1145/2661637
摘要

Nowadays, bilingual or multilingual speech recognition is confronted with the accent-related problem caused by non-native speech in a variety of real-world applications. Accent modeling of non-native speech is definitely challenging, because the acoustic properties in highly-accented speech pronounced by non-native speakers are quite divergent. The aim of this study is to generate highly Mandarin-accented English models for speakers whose mother tongue is Mandarin. First, a two-stage, state-based verification method is proposed to extract the state-level, highly-accented speech segments automatically. Acoustic features and articulatory features are successively used for robust verification of the extracted speech segments. Second, Gaussian components of the highly-accented speech models are generated from the corresponding Gaussian components of the native speech models using a linear transformation function. A decision tree is constructed to categorize the transformation functions and used for transformation function retrieval to deal with the data sparseness problem. Third, a discrimination function is further applied to verify the generated accented acoustic models. Finally, the successfully verified accented English models are integrated into the native bilingual phone model set for Mandarin-English bilingual speech recognition. Experimental results show that the proposed approach can effectively alleviate recognition performance degradation due to accents and can obtain absolute improvements of 4.1%, 1.8%, and 2.7% in word accuracy for bilingual speech recognition compared to that using traditional ASR approaches, MAP-adapted, and MLLR-adapted ASR methods, respectively.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
更新
大幅提高文件上传限制,最高150M (2024-4-1)

科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
yy完成签到 ,获得积分10
1秒前
1秒前
跟屁虫完成签到,获得积分10
1秒前
隐形曼青应助even采纳,获得10
1秒前
2秒前
3秒前
德濬完成签到,获得积分10
4秒前
ounceee发布了新的文献求助10
5秒前
zxy发布了新的文献求助10
5秒前
6秒前
洋葱头Yuu完成签到 ,获得积分10
7秒前
8秒前
英俊的铭应助等等采纳,获得10
9秒前
10秒前
11秒前
花开hhhhhhh完成签到,获得积分10
11秒前
小作坊钳工完成签到,获得积分10
13秒前
啊嚯发布了新的文献求助10
14秒前
even发布了新的文献求助10
14秒前
啦啦啦发布了新的文献求助10
14秒前
汉堡包应助tyfelix采纳,获得10
16秒前
研友_VZG7GZ应助啊嚯采纳,获得10
18秒前
摸鱼摸鱼摸摸鱼完成签到,获得积分10
20秒前
爱鱼人士应助Ann采纳,获得10
23秒前
介电发nature完成签到,获得积分20
23秒前
25秒前
28秒前
ding应助啦啦啦采纳,获得10
29秒前
30秒前
SOLOMON应助科研通管家采纳,获得10
32秒前
研友_VZG7GZ应助科研通管家采纳,获得10
32秒前
大模型应助科研通管家采纳,获得10
33秒前
丘比特应助科研通管家采纳,获得10
33秒前
领导范儿应助科研通管家采纳,获得10
33秒前
英姑应助科研通管家采纳,获得10
33秒前
天天快乐应助科研通管家采纳,获得10
33秒前
33秒前
CodeCraft应助科研通管家采纳,获得10
33秒前
在水一方应助科研通管家采纳,获得10
33秒前
shinysparrow应助科研通管家采纳,获得30
33秒前
高分求助中
The three stars each: the Astrolabes and related texts 1100
Berns Ziesemer - Maos deutscher Topagent: Wie China die Bundesrepublik eroberte 500
Stephen R. Mackinnon - Chen Hansheng: China’s Last Romantic Revolutionary (2023) 500
Psychological Warfare Operations at Lower Echelons in the Eighth Army, July 1952 – July 1953 400
宋、元、明、清时期“把/将”字句研究 300
Julia Lovell - Maoism: a global history 300
Classroom Discourse Competence 260
热门求助领域 (近24小时)
化学 材料科学 医学 生物 有机化学 工程类 生物化学 纳米技术 物理 内科学 计算机科学 化学工程 复合材料 遗传学 基因 物理化学 催化作用 电极 光电子学 量子力学
热门帖子
关注 科研通微信公众号,转发送积分 2432776
求助须知:如何正确求助?哪些是违规求助? 2115334
关于积分的说明 5365679
捐赠科研通 1843389
什么是DOI,文献DOI怎么找? 917359
版权声明 561559
科研通“疑难数据库(出版商)”最低求助积分说明 490718