立体声录音
计算机科学
仿人机器人
背景(考古学)
声源定位
过程(计算)
方位角
机器人
人工智能
人工神经网络
特征(语言学)
声音定位
语音识别
声音(地理)
声学
古生物学
语言学
哲学
物理
天文
生物
操作系统
作者
Karim Youssef,Sylvain Argentieri,Jean‐Luc Zarader
标识
DOI:10.1109/iros.2013.6696771
摘要
Sound source localization is an important feature designed and implemented on robots and intelligent systems. Like other artificial audition tasks, it is constrained to multiple problems, notably sound reflections and noises. This paper presents a sound source azimuth estimation approach in reverberant environments. It exploits binaural signals in a humanoid robotic context. Interaural Time and Level Differences (ITD and ILD) are extracted on multiple frequency bands and combined with a neural network-based learning scheme. A cue filtering process is used to reduce the reverberations effects. The system has been evaluated with simulation and real data, in multiple aspects covering realistic robot operating conditions, and was proven satisfying and effective as will be shown and discussed in the paper.
科研通智能强力驱动
Strongly Powered by AbleSci AI