📄
Abstract - Generative Modeling of Bach-Style Symbolic Music: A Comparative Study of Autoregressive, Latent-Variable, and Adversarial Approaches
We study generative modeling of Bach-style symbolic piano music using a shared MIDI corpus and three model families: autoregressive LSTMs with attention, latent-variable models including recurrent VAEs and vector-quantized VAEs, and generative adversarial networks. We compare their ability to model polyphonic note sequences, learn useful latent representations, and generate stylistically coherent compositions. Our experiments show that the autoregressive LSTM with attention produces the most musically coherent samples, while vector quantization helps mitigate posterior collapse and yields more structured outputs than conventional recurrent VAEs. The adversarial approach captures local pitch patterns but remains difficult to train and generalizes less reliably to Bach's style. These results highlight the relative strengths and failure modes of autoregressive, latent-variable, and adversarial approaches for symbolic music generation.
巴赫风格符号音乐的生成建模:自回归、潜变量与对抗方法的比较研究 /
Generative Modeling of Bach-Style Symbolic Music: A Comparative Study of Autoregressive, Latent-Variable, and Adversarial Approaches
1️⃣ 一句话总结
本文使用巴赫风格的钢琴MIDI数据,比较了三种生成模型(带注意力的自回归LSTM、潜变量模型如VAE和向量量化VAE、以及生成对抗网络),发现自回归模型生成的音乐最连贯,向量量化能解决潜变量模型的后验坍缩问题,而对抗网络虽然能捕捉局部音符模式但难以稳定训练。