Music Generation Resources
Papers
- HuBERT: Self-Supervised Speech Representation Learning by Masked Prediction of Hidden Units
- Robust Speech Recognition via Large-Scale Weak Supervision
- Enabling factorized piano music modeling and generation with the MAESTRO Dataset
- RAVE: A variational autoencoder for fast and high-quality neural audio synthesis
- Long-form music generation with latent diffusion
- Do Music Generation Models Encode Music Theory?
- Multi-Track MusicLDM: Towards Versatile Music Generation with Latent Diffusion Model
- Audio Prompt Adapter: Unleashing Music Editing Abilities for Text-to-Music with Lightweight Finetuning
- Simple and Controllable Music Generation
- WaveGlow: A Flow-based Generative Network for Speech Synthesis
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
- Improving Controllability and Editability for Pretrained Text-to-Music Generation Models