parlance-zz / dualdiffusionLinks
Dual Diffusion is a generative diffusion model for music trained on video game soundtracks.
☆79Updated this week
Alternatives and similar repositories for dualdiffusion
Users that are interested in dualdiffusion are comparing it to the libraries listed below
Sorting:
- Flexible LoRA Implementation to use with stable-audio-tools☆79Updated last year
- Fine-tune Stable Audio Open with DiT ControlNet.☆249Updated 7 months ago
- Generative models for conditional audio generation☆165Updated 10 months ago
- Fine-tune your own MusicGen with LoRA☆153Updated last year
- ☆184Updated last month
- Decked-out gradio client for audio diffusion, mainly stable-audio-tools.☆38Updated 2 months ago
- Awesome music generation model——MG²☆165Updated 9 months ago
- ☆87Updated 2 years ago
- YuE with mp3 extend, exllama and GUI☆64Updated 10 months ago
- ☆83Updated last year
- Trainer for audio-diffusion-pytorch☆130Updated 2 years ago
- Refactored / updated version of `stable-audio-tools` which is an open-source code for audio/music generative models originally by Stabili…☆214Updated last year
- [SOTA] [92% acc] 786M-8k-44L-32H multi-instrumental music transformer with true full MIDI instruments range, efficient encoding, octo-vel…☆88Updated last year
- fine-tuning MusicGen without prompts to generate music with a specific style☆67Updated 2 years ago
- Sing an idea ➡️ AI music sample🔥🎶☆119Updated last year
- a notebook containing scripts, documentation, and examples for finetuning musicgen☆99Updated last year
- ☆107Updated 2 years ago
- tools to manipulate audio with riffusion☆95Updated 2 years ago
- A novel diffusion-based model for synthesizing long-context, high-fidelity music efficiently.☆195Updated 2 years ago
- Anticipatory Autoregressive Models☆181Updated 2 months ago
- [ICASSP'24] Investigating Personalization Methods in Text to Music Generation☆44Updated last year
- Text-to-Music Generation with Rectified Flow Transformer☆64Updated 7 months ago
- ☆171Updated 2 years ago
- Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generati…☆190Updated last year
- A collection of pre-trained audio models, in PyTorch.☆114Updated 2 years ago
- ☆51Updated last year
- Fork of ACE-Step for LoRA training with < 10 GB VRAM☆59Updated last month
- Encode and decode audio samples to/from compressed latent representations!☆241Updated 3 months ago
- Metrics for evaluating music and audio generative models – with a focus on long-form, full-band, and stereo generations.☆273Updated 3 weeks ago
- open soundstream-ish VAE codecs for downstream neural audio synthesis☆120Updated 2 years ago