语音识别
计算机科学
可靠性(半导体)
说话人识别
安全性令牌
模式识别(心理学)
声学
人工智能
物理
功率(物理)
计算机安全
量子力学
作者
Ewald Enzinger,Cuiling Zhang
摘要
For features to be effective in forensic voice comparison, they must have relatively low within-speaker variability and relatively high between-speaker variability. An understudied source of features, which potentially meets these criteria is the acoustic spectrum of nasals. Nasals spectra contain poles and zeros dependent upon nasal cavities. The latter are complex static structures which vary from person to person. Theoretically, nasal spectra may therefore have low within-speaker and high between-speaker variabilities. This study evaluates different methods for extracting spectral features (e.g., pole-zero models, all-pole models, and cepstra) and using them as part of a likelihood-ratio forensic-voice-comparison system. The validity and reliability of each system is empirically evaluated using /m/ and /n/ token extracted from a database of voice recordings of 60 female speakers of Chinese.
科研通智能强力驱动
Strongly Powered by AbleSci AI