Recognition and real time performances of a lightweight ultrasound based silent speech interface employing a language model

计算机科学 语音识别 接口(物质) 语言模型 自然语言处理 人工智能 最大气泡压力法 气泡 并行计算
作者
Jun Cai,Bruce Denby,Pierre Roussel,Gérard Dreyfus,Lise Crevier‐Buchman
标识
DOI:10.21437/interspeech.2011-410
摘要

Abstract The work presents advances in the implementation of an ultrasound based silent speech interface system. Use of a portable acquisition device, a visual speech recognizer system with a language model, and real time tests with the Julius system are described. Experiments with two types of visual feature extraction are also presented. Results show that good recognition and real time performance can be obtained with a portable silent speech interface employing a language model. Index Terms : silent speech interface, visual speech recognition, vocal tract imaging, ultrasound imaging 1. Introduction A silent speech interface (SSI) is intended to enable speech communication in the absence of an intelligible acoustic signal [1]. Several experimental SSI systems have been developed using a variety of different sensors [1]. The REVOIX project at the Sigma Laboratory in Paris is building an SSI meant to restore the voices of speech-impaired individuals in real-time. The technique chosen for REVOIX is to drive a recognizer-synthesizer system using ultrasound and video images of the tongue and lips. The REVOIX SSI thus consists of three modules operating sequentially: (1) an acquisition module to record simultaneous ultrasound and visual images of the vocal tract; (2) a word-level visual speech recognizer that uses Hidden Markov Models trained on features extracted from these images (HTK toolkit [7]), rather than from acoustic features; and (3) a speech synthesizer. To be genuinely useful, such a device will ultimately have to be lightweight, have good recognition and synthesis performance, and operate in real time. In this report, we build upon the groundwork laid in earlier research [2-6] by:  Introducing a new, portable acquisition system;  Comparing different types of visual feature extraction;  Introducing the use of a language model to improve the recognition accuracy;  Experimenting with a real time implementation of the recognition using the Julius system. Our results show that it is possible to obtain good recognition and real time performance using a portable SSI system employing a language model. The visual speech acquisition system and the acquired corpora are described in Section 2 and 3. In Section 4, two visual speech feature extraction techniques, namely the EigenTongues/EigenLips and the Discrete Cosine Transform (DCT), are presented. The experimental results are given in Section 5. Conclusions are drawn in Section 6.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
机灵萝完成签到 ,获得积分10
1秒前
4秒前
Jaylou完成签到,获得积分10
5秒前
英俊的铭应助热情路人采纳,获得10
11秒前
机灵萝关注了科研通微信公众号
11秒前
英姑应助Aurora采纳,获得10
11秒前
阿良完成签到 ,获得积分10
13秒前
wad1314完成签到,获得积分10
13秒前
PMX发布了新的文献求助10
13秒前
迷路的手机完成签到,获得积分10
14秒前
Vesper完成签到 ,获得积分10
16秒前
淡定水杯完成签到,获得积分10
17秒前
och3完成签到,获得积分10
17秒前
星辰大海应助Rongli采纳,获得10
19秒前
where完成签到,获得积分10
21秒前
22秒前
清图完成签到,获得积分10
25秒前
26秒前
dwalll关注了科研通微信公众号
29秒前
PMX完成签到,获得积分20
29秒前
30秒前
31秒前
35秒前
s2183622完成签到,获得积分10
35秒前
37秒前
37秒前
默默琳完成签到,获得积分10
38秒前
Wian发布了新的文献求助10
39秒前
尔信完成签到 ,获得积分10
39秒前
icel完成签到,获得积分10
39秒前
Akim应助科研通管家采纳,获得10
40秒前
科研助手6应助科研通管家采纳,获得10
40秒前
orixero应助科研通管家采纳,获得10
40秒前
科研助手6应助科研通管家采纳,获得10
40秒前
NexusExplorer应助科研通管家采纳,获得10
40秒前
40秒前
英俊的铭应助科研通管家采纳,获得10
40秒前
李健应助PMX采纳,获得10
40秒前
动漫大师发布了新的文献求助10
42秒前
黄可以完成签到,获得积分10
42秒前
高分求助中
【此为提示信息,请勿应助】请按要求发布求助,避免被关 20000
Continuum Thermodynamics and Material Modelling 2000
Encyclopedia of Geology (2nd Edition) 2000
105th Edition CRC Handbook of Chemistry and Physics 1600
Maneuvering of a Damaged Navy Combatant 650
Mixing the elements of mass customisation 300
the MD Anderson Surgical Oncology Manual, Seventh Edition 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3778011
求助须知:如何正确求助?哪些是违规求助? 3323664
关于积分的说明 10215380
捐赠科研通 3038867
什么是DOI,文献DOI怎么找? 1667677
邀请新用户注册赠送积分活动 798341
科研通“疑难数据库(出版商)”最低求助积分说明 758339