残差神经网络
计算机科学
水准点(测量)
卷积神经网络
语音识别
说话人识别
人工智能
说话人识别
深度学习
鉴定(生物学)
人工神经网络
深层神经网络
模式识别(心理学)
植物
大地测量学
生物
地理
作者
Maroš Jakubec,Eva Lieskovská,Roman Jarina
标识
DOI:10.1109/radioelektronika52220.2021.9420202
摘要
Various deep neural network (DNN) topologies have been recently introduced for automatic speaker recognition (SR) tasks, which tend to have ever deeper architectures. Several specific architectures have been proposed to tackle the well-known vanishing gradient issue in DNN training. We present results of experiments on text-independent SR in the wild with two deep Convolutional DNN architectures: ResNet and VGG. The results on the VoxCeleb1 benchmark database demonstrate the superiority of the ResNet solutions, in comparison with VGG and standard i-vector approaches, for speaker identification and verification in realistic acoustic conditions.
科研通智能强力驱动
Strongly Powered by AbleSci AI