计算机科学
发电机(电路理论)
磁道(磁盘驱动器)
生成语法
一致性(知识库)
趋同(经济学)
人工智能
节奏
机器学习
语音识别
操作系统
功率(物理)
哲学
物理
美学
量子力学
经济
经济增长
作者
Luyu Chen,Sheng Huang Lin,Dian Yu,Zhihua Wang,Kun Qian,Bin Hu,Björn W. Schuller,Yoshiharu Yamamoto
标识
DOI:10.1109/gcce59613.2023.10315503
摘要
Music generation with artificial intelligence is a complex and captivating task. The utilisation of generative adversarial networks (GANs) has exhibited promising outcomes in producing realistic and diverse music compositions. In this paper, we propose a model based on Wasserstein GAN with gradient penalty (WGAN-GP) for multi-track music generation. This model incorporates self-attention and introduces a novel cross-attention mechanism in the generator to enhance its expressive capability. Additionally, we transpose all music to C major in training to ensure data consistency and quality. Experimental results demonstrate that our model can produce multi-track music with enhanced rhythm and sound characteristics, accelerate convergence, and improve generation quality.
科研通智能强力驱动
Strongly Powered by AbleSci AI