发布文献求助

SPICE: Self-Supervised Pitch Estimation

计算机科学编码器任务（项目管理）语音识别人工智能基本事实钥匙（锁）信号（编程语言）监督学习基音检测算法模式识别（心理学）语音处理人工神经网络操作系统计算机安全经济管理程序设计语言

作者

Beat Gfeller,Christian Frank,Dominik Roblek,Matt Sharifi,Marco Tagliasacchi,Mihajlo Velimirović

出处

期刊：IEEE/ACM transactions on audio, speech, and language processing [Institute of Electrical and Electronics Engineers]
日期：2020-01-01 卷期号：28: 1118-1128 被引量：33

链接

ieee.org arxiv.org arxiv.org datacite.orgdoi.org

标识

DOI：10.1109/taslp.2020.2982285

摘要

We propose a model to estimate the fundamental frequency in monophonic audio, often referred to as pitch estimation. We acknowledge the fact that obtaining ground truth annotations at the required temporal and frequency resolution is a particularly daunting task. Therefore, we propose to adopt a self-supervised learning technique, which is able to estimate pitch without any form of supervision. The key observation is that pitch shift maps to a simple translation when the audio signal is analysed through the lens of the constant-Q transform (CQT). We design a self-supervised task by feeding two shifted slices of the CQT to the same convolutional encoder, and require that the difference in the outputs is proportional to the corresponding difference in pitch. In addition, we introduce a small model head on top of the encoder, which is able to determine the confidence of the pitch estimate, so as to distinguish between voiced and unvoiced audio. Our results show that the proposed method is able to estimate pitch at a level of accuracy comparable to fully supervised models, both on clean and noisy audio samples, although it does not require access to large labeled datasets.

求助该文献

最长约 10秒，即可获得该文献文件

科研通智能强力驱动
Strongly Powered by AbleSci AI

我的文献求助列表浏览历史

一分钟了解求助规则 | 捐赠本站 | 论文查重

更新

大幅提高文件上传限制，最高150M (2024-4-1)

更新

新增期刊收藏功能 (2024-03-23)

科研通是完全免费的文献互助平台，具备全网最快的应助速度，最高的求助完成率。对每一个文献求助，科研通都将尽心尽力，给求助人一个满意的交代。

实时播报: CC完成签到，获得积分10

2秒前; 自信的秀发完成签到，获得积分10

2秒前; centlay上传了应助文件

2秒前; 阳光友蕊完成签到，获得积分10

3秒前; 王多肉完成签到，获得积分10

4秒前; Lucas上传了应助文件

5秒前; JamesPei上传了应助文件

5秒前; 诺亚方舟哇哈哈完成签到，获得积分0

6秒前; 爱吃鱼的猫完成签到，获得积分10

6秒前; 龙丹妮子呀完成签到，获得积分10

7秒前; 个性的冥王星完成签到，获得积分10

8秒前; 秋雪瑶的应助被大气的小蜜蜂采纳，获得10

8秒前; hif1a发布了新的文献求助10

9秒前; shelemi发布了新的文献求助10

10秒前; orixero的应助被CMUSK采纳，获得10

11秒前; 酷波er的应助被Alane采纳，获得10

12秒前; 今后上传了应助文件

13秒前; Dingz完成签到，获得积分10

14秒前; hif1a完成签到，获得积分10

15秒前; 依依完成签到，获得积分10

16秒前; 总是犯错的男人完成签到，获得积分10

17秒前; ERICLEE82完成签到，获得积分10

17秒前; lalala上传了应助文件

18秒前; 钮之桃完成签到，获得积分10

18秒前; 白桃乌龙完成签到，获得积分10

18秒前; 绿蜡发布了新的文献求助200

18秒前; 浪花淘尽英雄发布了新的文献求助10

19秒前; 爱学习的树袋熊完成签到，获得积分10

19秒前; 猩心完成签到，获得积分10

19秒前; orixero上传了应助文件

20秒前; 科目三的应助被卷网那个采纳，获得10

20秒前; CodeCraft上传了应助文件

20秒前; 爱学习爱劳动完成签到，获得积分10

20秒前; 打卡下班上传了应助文件

21秒前; 巧克力酱完成签到，获得积分10

22秒前; 天天快乐的应助被爱学习的树袋熊采纳，获得10

23秒前; CMUSK发布了新的文献求助10

24秒前; shelemi完成签到，获得积分10

24秒前; tao完成签到，获得积分10

25秒前; 如意竺发布了新的文献求助10

25秒前

高分求助中: 请在求助之前详细阅读求助说明！！！！ 20000; One Man Talking: Selected Essays of Shao Xunmei, 1929–1939 1000; The Three Stars Each: The Astrolabes and Related Texts 900; Yuwu Song, Biographical Dictionary of the People's Republic of China 800; Multifunctional Agriculture, A New Paradigm for European Agriculture and Rural Development 600; Bernd Ziesemer - Maos deutscher Topagent: Wie China die Bundesrepublik eroberte 500; A radiographic standard of reference for the growing knee 400

热门求助领域（近24小时）

热门帖子: 关注科研通微信公众号，转发送积分 2478958; 求助须知：如何正确求助？哪些是违规求助？ 2141596; 关于积分的说明 5459693; 捐赠科研通 1864740; 什么是DOI，文献DOI怎么找？ 926997; 版权声明 562915; 科研通“疑难数据库（出版商）”最低求助积分说明 496023

今日热心研友

个性的紫菜

坚强的广山

紫金大萝卜

注：热心度 = 本日应助数 + 本日被采纳获取积分÷10

Copyright © 2020-2024 AbleSci.COM, 科研通, All Right Reserved

科研通是非营利科研互助平台，不忘初心，为科研助力

本站互助的所有文件仅供个人学习研究用，禁止任何人把求助的所得文献进行盈利或传播

皖ICP备2024041134号-1

皖公网安备34019202002308

科研通【文献互助QQ群】：826996720【点击一键加群】如果您有特殊求助，或发布求助超过24小时未得到应助，可加群求助

科研通【志愿服务QQ群】：如果您热爱文献互助，有热心愿意为更多人服务，请加入小伙伴群，点击申请加入

关注微信服务号

科研通