编码器
计算机科学
自编码
反褶积
波束赋形
人工智能
嵌入
职位(财务)
模式识别(心理学)
深度学习
算法
电信
财务
操作系统
经济
作者
H JIA,Feiran Yang,Xiaoqing Hu,Jun Yang
摘要
The deconvolution approach has become a standard method for high-resolution acoustic source mapping, but it suffers from a heavy computational burden. Deep learning-based methods have shown promising progress but often rely on single-type input features and ignore the position- and frequency-dependent variabilities of the point spread function (PSF), which leads to a decline in localization accuracy. This paper proposes a supervised learning framework based on dual-encoder U-net architecture to convert beamforming maps into a high-resolution map of true source strength distribution. Specifically, the model employs two individual encoders to extract complementary features from delay-and-sum and functional beamforming maps. Because the two maps provide distinct information on the same source strength distribution, a contrastive loss function is introduced to help encoders learn consistent latent features of sources. To characterize the PSF variations, a frequency encoder and position encoder are designed to embed prior knowledge, i.e., source frequency and grid positions, into the backbone network. The proposed model outperforms competing methods, on average, across four metrics for the simulation data and MIRACLE dataset and generalizes well across different numbers of sound sources and frequencies.
科研通智能强力驱动
Strongly Powered by AbleSci AI