过渡(遗传学)
心理学
情绪识别
情绪分类
计算机科学
认知心理学
语音识别
生物化学
基因
化学
作者
Zihao Wang,Le Ma,Chen Zhang,Bo Han,Yunfei Xu,Yikai Wang,Xinyi Chen,Haorong Hong,Wenbo Liu,Xinda Wu,Kejun Zhang
标识
DOI:10.1109/taffc.2024.3486224
摘要
Music as an emotional intervention media has important applications in scenarios such as music therapy, games, and movies. However, music needs real-time arrangement according to changing emotions, bringing challenges to balance emotion real-time fit and soft emotion transition due to the fine-grained and mutable nature of the target emotion. Existing studies mainly focus on achieving emotion real-time fit, while the issue of smooth transition remains understudied, affecting the overall emotional coherence of the music. In this paper, we propose REMAST to address this trade-off. Specifically, we recognize the last timestep's music emotion and fuse it with the current timestep's input emotion. The fused emotion then guides REMAST to generate the music based on the input melody. To adjust music similarity and emotion real-time fit flexibly, we downsample the original melody and feed it into the generation model. Furthermore, we design four music theory features by domain knowledge to enhance emotion information and employ semi-supervised learning to mitigate the subjective bias introduced by manual dataset annotation. According to the evaluation results, REMAST surpasses the state-of-the-art methods in objective and subjective metrics. These results demonstrate that REMAST achieves real-time fit and smooth transition simultaneously, enhancing the coherence of the generated music.
科研通智能强力驱动
Strongly Powered by AbleSci AI