计算机科学
人工智能
音频信号处理
头部相关传递函数
传递函数
耳机
积极倾听
均衡(音频)
语音识别
学习迁移
计算机视觉
过程(计算)
音频信号
频道(广播)
声学
工程类
计算机网络
物理
语音编码
立体声录音
沟通
社会学
电气工程
操作系统
作者
Nikhil Javeri,Prabal Bijoy Dutta,Kaushik Sunder,Kapil Jain
出处
期刊:2021 Immersive and 3D Audio: from Architecture to Automotive (I3DA)
日期:2023-09-05
卷期号:: 1-9
标识
DOI:10.1109/i3da57090.2023.10289448
摘要
In the realm of extended reality (XR) applications, personalized Head-Related Transfer Functions (p-HRTFs) have emerged as a critical element for achieving exceptional spatial audio quality. Nevertheless, the process of acquiring personalized HRTFs is intricate and time-intensive. This paper introduces an innovative technique that predicts personalized HRTFs using 2D images or video captures. The proposed method encompasses several key components, including 3D ear reconstruction from 2D images or video, followed by HRTF estimation through Boundary Element Methods or HRTF prediction using Neural Networks. Furthermore, a novel approach for estimating Personalized Headphone Equalization (p-HPEQ) curves, leveraging optical data of the ear, is presented. This approach enhances the accessibility and convenience of personalized spatial audio, leading to tailored listening experiences through customized headphone audio. Objective and subjective experiments are conducted to validate the accuracy of both p-HRTFs and p-HPEQs.
科研通智能强力驱动
Strongly Powered by AbleSci AI