Chord(对等)
计算机科学
语音识别
感知器
音乐信息检索
Mel倒谱
人工智能
特征提取
人工神经网络
数据库
艺术
音乐剧
视觉艺术
作者
Weite Feng,Junrui Liu,Tong Li,Zhen Yang,Di Wu
标识
DOI:10.1142/s0218194022500577
摘要
Music content has recently been identified as useful information to promote the performance of music recommendations. Existing studies usually feed low-level audio features, such as the Mel-frequency cepstral coefficients, into deep learning models for music recommendations. However, such features cannot well characterize music audios, which often contain multiple sound sources. In this paper, we propose to model and fuse chord, melody, and rhythm features to meaningfully characterize the music so as to improve the music recommendation. Specially, we use two user-based attention mechanisms to differentiate the importance of different parts of audio features and chord features. In addition, a Long Short-Term Memory layer is used to capture the sequence characteristics. Those features are fused by a multilayer perceptron and then used to make recommendations. We conducted experiments with a subset of the last.fm-1b dataset. The experimental results show that our proposal outperforms the best baseline by [Formula: see text] on HR@10.
科研通智能强力驱动
Strongly Powered by AbleSci AI