亲爱的研友该休息了!由于当前在线用户较少,发布求助请尽量完整地填写文献信息,科研通机器人24小时在线,伴您度过漫漫科研夜!身体可是革命的本钱,早点休息,好梦!

Paralinguistic singing attribute recognition using supervised machine learning for describing the classical tenor solo singing voice in vocal pedagogy

副语言 唱歌 计算机科学 语音识别 支持向量机 人工智能 特征(语言学) 沟通 语言学 心理学 声学 哲学 物理
作者
Yanze Xu,Weiqing Wang,Huahua Cui,Mingyang Xu,Ming Li
出处
期刊:Eurasip Journal on Audio, Speech, and Music Processing [Springer Nature]
卷期号:2022 (1) 被引量:2
标识
DOI:10.1186/s13636-022-00240-z
摘要

Humans can recognize someone's identity through their voice and describe the timbral phenomena of voices. Likewise, the singing voice also has timbral phenomena. In vocal pedagogy, vocal teachers listen and then describe the timbral phenomena of their student's singing voice. In this study, in order to enable machines to describe the singing voice from the vocal pedagogy point of view, we perform a task called paralinguistic singing attribute recognition. To achieve this goal, we first construct and publish an open source dataset named Singing Voice Quality and Technique Database (SVQTD) for supervised learning. All the audio clips in SVQTD are downloaded from YouTube and processed by music source separation and silence detection. For annotation, seven paralinguistic singing attributes commonly used in vocal pedagogy are adopted as the labels. Furthermore, to explore the different supervised machine learning algorithm for classifying each paralinguistic singing attribute, we adopt three main frameworks, namely openSMILE features with support vector machine (SF-SVM), end-to-end deep learning (E2EDL), and deep embedding with support vector machine (DE-SVM). Our methods are based on existing frameworks commonly employed in other paralinguistic speech attribute recognition tasks. In SF-SVM, we separately use the feature set of the INTERSPEECH 2009 Challenge and that of the INTERSPEECH 2016 Challenge as the SVM classifier's input. In E2EDL, the end-to-end framework separately utilizes the ResNet and transformer encoder as feature extractors. In particular, to handle two-dimensional spectrogram input for a transformer, we adopt a sliced multi-head self-attention (SMSA) mechanism. In the DE-SVM, we use the representation extracted from the E2EDL model as the input of the SVM classifier. Experimental results on SVQTD show no absolute winner between E2EDL and the DE-SVM, which means that the back-end SVM classifier with the representation learned by E2E as input does not necessarily improve the performance. However, the DE-SVM that utilizes the ResNet as the feature extractor achieves the best average UAR, with an average 16% improvement over that of the SF-SVM with INTERSPEECH's hand-crafted feature set.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
飘逸的水蜜桃完成签到,获得积分10
刚刚
思源应助bbband采纳,获得10
1秒前
不安的问梅完成签到,获得积分10
4秒前
多情嫣然完成签到,获得积分10
5秒前
情怀应助不安的问梅采纳,获得10
8秒前
8秒前
8秒前
bbband发布了新的文献求助10
14秒前
小药丸完成签到 ,获得积分10
17秒前
昂帕帕斯完成签到,获得积分10
18秒前
丹霞应助小雨采纳,获得10
20秒前
xueshu666完成签到 ,获得积分10
21秒前
Canonical_SMILES完成签到 ,获得积分10
25秒前
26秒前
淡淡碧玉完成签到,获得积分10
27秒前
Vino发布了新的文献求助10
30秒前
岁岁完成签到,获得积分10
31秒前
小蘑菇应助笑嘻嘻采纳,获得10
31秒前
笑嘻嘻完成签到,获得积分10
41秒前
42秒前
临子完成签到,获得积分10
43秒前
爱思考的小笨笨完成签到,获得积分10
44秒前
笑嘻嘻发布了新的文献求助10
46秒前
48秒前
51秒前
yxl完成签到,获得积分10
52秒前
zichun_du发布了新的文献求助10
53秒前
xny发布了新的文献求助10
56秒前
1分钟前
Bob完成签到,获得积分10
1分钟前
Mniwl应助小雨采纳,获得10
1分钟前
1分钟前
Doupright完成签到 ,获得积分10
1分钟前
充电宝应助王磊采纳,获得10
1分钟前
1分钟前
SciGPT应助xny采纳,获得10
1分钟前
牛幻香完成签到,获得积分10
1分钟前
小雨完成签到,获得积分20
1分钟前
千早爱音完成签到,获得积分10
1分钟前
1分钟前
高分求助中
(应助此贴封号)【重要!!请各用户(尤其是新用户)详细阅读】【科研通的精品贴汇总】 10000
Les Mantodea de Guyane Insecta, Polyneoptera 2000
Quality by Design - An Indispensable Approach to Accelerate Biopharmaceutical Product Development 800
Pulse width control of a 3-phase inverter with non sinusoidal phase voltages 777
Signals, Systems, and Signal Processing 610
Research Methods for Applied Linguistics: A Practical Guide 600
Research Methods for Applied Linguistics 500
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6404226
求助须知:如何正确求助?哪些是违规求助? 8223454
关于积分的说明 17429529
捐赠科研通 5456588
什么是DOI,文献DOI怎么找? 2883572
邀请新用户注册赠送积分活动 1859839
关于科研通互助平台的介绍 1701261