亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整的填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

Visual Facial Enhancements Can Significantly Improve Speech Perception in the Presence of Noise

噪音(视频) 感知 计算机科学 语音识别 可视化 计算机视觉 人工智能 人机交互 心理学 图像(数学) 神经科学
作者
Zubin Choudhary,Gerd Bruder,Greg Welch
出处
期刊:IEEE Transactions on Visualization and Computer Graphics [Institute of Electrical and Electronics Engineers]
卷期号:29 (11): 4751-4760 被引量:1
标识
DOI:10.1109/tvcg.2023.3320247
摘要

Human speech perception is generally optimal in quiet environments, however it becomes more difficult and error prone in the presence of noise, such as other humans speaking nearby or ambient noise. In such situations, human speech perception is improved by speech reading, i.e., watching the movements of a speaker's mouth and face, either consciously as done by people with hearing loss or subconsciously by other humans. While previous work focused largely on speech perception of two-dimensional videos of faces, there is a gap in the research field focusing on facial features as seen in head-mounted displays, including the impacts of display resolution, and the effectiveness of visually enhancing a virtual human face on speech perception in the presence of noise. In this paper, we present a comparative user study ( N=21) in which we investigated an audio-only condition compared to two levels of head-mounted display resolution ( 1832×1920 or 916×960 pixels per eye) and two levels of the native or visually enhanced appearance of a virtual human, the latter consisting of an up-scaled facial representation and simulated lipstick (lip coloring) added to increase contrast. To understand effects on speech perception in noise, we measured participants' speech reception thresholds (SRTs) for each audio-visual stimulus condition. These thresholds indicate the decibel levels of the speech signal that are necessary for a listener to receive the speech correctly 50% of the time. First, we show that the display resolution significantly affected participants' ability to perceive the speech signal in noise, which has practical implications for the field, especially in social virtual environments. Second, we show that our visual enhancement method was able to compensate for limited display resolution and was generally preferred by participants. Specifically, our participants indicated that they benefited from the head scaling more than the added facial contrast from the simulated lipstick. We discuss relationships, implications, and guidelines for applications that aim to leverage such enhancements.
最长约 10秒,即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
zzgpku完成签到,获得积分0
17秒前
17秒前
loen完成签到,获得积分10
32秒前
追寻绮玉完成签到,获得积分10
46秒前
深情安青应助yyg采纳,获得30
1分钟前
1分钟前
1分钟前
yyg发布了新的文献求助30
1分钟前
楠茸完成签到 ,获得积分10
1分钟前
2分钟前
闪闪的谷梦完成签到 ,获得积分10
2分钟前
我是老大应助yyg采纳,获得10
2分钟前
feiCheung完成签到 ,获得积分10
2分钟前
二二二发布了新的文献求助50
2分钟前
3分钟前
3分钟前
3分钟前
12345完成签到 ,获得积分20
4分钟前
4分钟前
开朗硬币发布了新的文献求助10
4分钟前
Worenxian完成签到,获得积分10
4分钟前
4分钟前
lihongjie发布了新的文献求助10
4分钟前
5分钟前
5分钟前
earthai完成签到,获得积分10
5分钟前
自然之水完成签到,获得积分10
5分钟前
易水寒完成签到 ,获得积分10
5分钟前
芝麻汤圆完成签到,获得积分10
5分钟前
彭于晏应助忧虑的孤萍采纳,获得10
5分钟前
二二二完成签到,获得积分10
5分钟前
在水一方应助科研通管家采纳,获得10
5分钟前
5分钟前
GeoEye发布了新的文献求助30
6分钟前
ki完成签到 ,获得积分10
6分钟前
高数数完成签到 ,获得积分10
7分钟前
12345关注了科研通微信公众号
7分钟前
充电宝应助科研通管家采纳,获得10
7分钟前
mmyhn应助科研通管家采纳,获得10
7分钟前
依然灬聆听完成签到,获得积分10
7分钟前
高分求助中
【此为提示信息,请勿应助】请按要求发布求助,避免被关 20000
Continuum Thermodynamics and Material Modelling 2000
Encyclopedia of Geology (2nd Edition) 2000
105th Edition CRC Handbook of Chemistry and Physics 1600
Maneuvering of a Damaged Navy Combatant 650
Периодизация спортивной тренировки. Общая теория и её практическое применение 310
Mixing the elements of mass customisation 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 工程类 有机化学 物理 生物化学 纳米技术 计算机科学 化学工程 内科学 复合材料 物理化学 电极 遗传学 量子力学 基因 冶金 催化作用
热门帖子
关注 科研通微信公众号,转发送积分 3779106
求助须知:如何正确求助?哪些是违规求助? 3324745
关于积分的说明 10219794
捐赠科研通 3039837
什么是DOI,文献DOI怎么找? 1668452
邀请新用户注册赠送积分活动 798658
科研通“疑难数据库(出版商)”最低求助积分说明 758503