计算机科学
源分离
编码器
谐波
语音识别
人工智能
声学
物理
操作系统
作者
Wootaek Lım,Taejin Lee
标识
DOI:10.23919/eusipco.2017.8081520
摘要
Real world audio signals are generally a mixture of harmonic and percussive sounds. In this paper, we present a novel method for separating the harmonic and percussive audio signals from an audio mixture. Proposed method involves the use of a convolutional auto-encoder on a magnitude of the spectrogram to separate the harmonic and percussive signals. This network structure enables automatic high-level feature learning and spectral domain audio decomposition. An evaluation was performed using professionally produced music recording. Consequently, we confirm that the proposed method provides superior separation performance compared to conventional methods.
科研通智能强力驱动
Strongly Powered by AbleSci AI