Paralinguistic singing attribute recognition using supervised machine learning for describing the classical tenor solo singing voice in vocal pedagogy

副语言 唱歌 计算机科学 语音识别 支持向量机 人工智能 特征(语言学) 沟通 语言学 心理学 声学 哲学 物理
作者
Yanze Xu,Weiqing Wang,Huahua Cui,Mingyang Xu,Ming Li
出处
期刊:Eurasip Journal on Audio, Speech, and Music Processing [Springer Nature]
卷期号:2022 (1) 被引量:2
标识
DOI:10.1186/s13636-022-00240-z
摘要

Humans can recognize someone's identity through their voice and describe the timbral phenomena of voices. Likewise, the singing voice also has timbral phenomena. In vocal pedagogy, vocal teachers listen and then describe the timbral phenomena of their student's singing voice. In this study, in order to enable machines to describe the singing voice from the vocal pedagogy point of view, we perform a task called paralinguistic singing attribute recognition. To achieve this goal, we first construct and publish an open source dataset named Singing Voice Quality and Technique Database (SVQTD) for supervised learning. All the audio clips in SVQTD are downloaded from YouTube and processed by music source separation and silence detection. For annotation, seven paralinguistic singing attributes commonly used in vocal pedagogy are adopted as the labels. Furthermore, to explore the different supervised machine learning algorithm for classifying each paralinguistic singing attribute, we adopt three main frameworks, namely openSMILE features with support vector machine (SF-SVM), end-to-end deep learning (E2EDL), and deep embedding with support vector machine (DE-SVM). Our methods are based on existing frameworks commonly employed in other paralinguistic speech attribute recognition tasks. In SF-SVM, we separately use the feature set of the INTERSPEECH 2009 Challenge and that of the INTERSPEECH 2016 Challenge as the SVM classifier's input. In E2EDL, the end-to-end framework separately utilizes the ResNet and transformer encoder as feature extractors. In particular, to handle two-dimensional spectrogram input for a transformer, we adopt a sliced multi-head self-attention (SMSA) mechanism. In the DE-SVM, we use the representation extracted from the E2EDL model as the input of the SVM classifier. Experimental results on SVQTD show no absolute winner between E2EDL and the DE-SVM, which means that the back-end SVM classifier with the representation learned by E2E as input does not necessarily improve the performance. However, the DE-SVM that utilizes the ResNet as the feature extractor achieves the best average UAR, with an average 16% improvement over that of the SF-SVM with INTERSPEECH's hand-crafted feature set.

科研通智能强力驱动
Strongly Powered by AbleSci AI
科研通是完全免费的文献互助平台,具备全网最快的应助速度,最高的求助完成率。 对每一个文献求助,科研通都将尽心尽力,给求助人一个满意的交代。
实时播报
whiskyzz完成签到,获得积分10
1秒前
Mermaid发布了新的文献求助20
1秒前
Zongxin完成签到,获得积分10
1秒前
1秒前
呆萌的莲完成签到,获得积分10
1秒前
勇yi完成签到,获得积分10
1秒前
建浩发布了新的文献求助10
1秒前
格格巫完成签到,获得积分10
2秒前
lxrong完成签到,获得积分10
2秒前
danrushui777完成签到,获得积分10
2秒前
3秒前
3秒前
牛马完成签到 ,获得积分10
3秒前
LlLly完成签到 ,获得积分10
3秒前
4秒前
xxcing关注了科研通微信公众号
4秒前
4秒前
kkk完成签到 ,获得积分10
5秒前
5秒前
共享精神应助桐桐采纳,获得10
5秒前
5秒前
赘婿应助喜悦的听白采纳,获得10
6秒前
科研通AI6.4应助Kondo采纳,获得10
6秒前
程志强完成签到 ,获得积分10
6秒前
长情正豪完成签到,获得积分10
6秒前
lllllll完成签到,获得积分10
6秒前
潜龙发布了新的文献求助10
6秒前
7秒前
cg666完成签到 ,获得积分10
8秒前
小西瓜发布了新的文献求助10
8秒前
赘婿应助麦兜采纳,获得10
8秒前
kieler完成签到,获得积分10
9秒前
77发布了新的文献求助10
9秒前
路灯下的小伙完成签到,获得积分10
9秒前
9秒前
10秒前
10秒前
11秒前
所所应助小虚心采纳,获得30
11秒前
GOAT发布了新的文献求助50
11秒前
高分求助中
Adhesion Science: Principles & Practice 1234
Signals, Systems, and Signal Processing 610
Burger's Medicinal Chemistry and Drug Discovery 400
A Step-by-Step Guide to Qualitative Data Coding 2nd Edition 400
Impact of Storage Orientation and Duration on Prefilled Syringe Performance: Break-Loose and Glide Forces, and Injection Time Across Multiple Time Points 360
Programming for Chemical Engineers Using C, C++, and MATLAB 300
Upland Kenya wild flowers and ferns: a flora of the flowers, ferns, grasses, and sedges of highland Kenya 300
热门求助领域 (近24小时)
化学 材料科学 医学 生物 纳米技术 工程类 有机化学 化学工程 生物化学 计算机科学 物理 内科学 复合材料 催化作用 物理化学 光电子学 电极 细胞生物学 基因 无机化学
热门帖子
关注 科研通微信公众号,转发送积分 6665669
求助须知:如何正确求助?哪些是违规求助? 8415204
关于积分的说明 17989207
捐赠科研通 5871581
什么是DOI,文献DOI怎么找? 2975796
邀请新用户注册赠送积分活动 1951705
关于科研通互助平台的介绍 1878614