Cypress-Yang / SongBloomLinks
☆277Updated last month
Alternatives and similar repositories for SongBloom
Users that are interested in SongBloom are comparing it to the libraries listed below
Sorting:
- Fork of ACE-Step for LoRA training with < 10 GB VRAM☆35Updated last week
- YuE with mp3 extend, exllama and GUI☆58Updated 6 months ago
- BeltOut: An open source pitch-perfect voice-to-voice timbre transfer model based on ChatterboxVC☆71Updated last month
- Awesome music generation model——MG²☆159Updated 4 months ago
- JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment☆82Updated 2 weeks ago
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆308Updated last month
- Gradio UI for YuE☆70Updated 4 months ago
- ☆270Updated last month
- [ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation☆255Updated last month
- ☆706Updated 2 weeks ago
- YuE: Open Full-song Generation Foundation Model, something similar to Suno.ai but open☆70Updated 3 months ago
- ☆181Updated 7 months ago
- Text-to-Music Generation with Rectified Flow Transformer☆64Updated 2 months ago
- ☆75Updated last year
- Examples of using the llasa-tts models locally☆179Updated 4 months ago
- YuE: Open Full-song Generation Foundation for the GPU Poor☆433Updated 6 months ago
- ☆120Updated 5 months ago
- Fine-tune Stable Audio Open with DiT ControlNet.☆243Updated 3 months ago
- ☆79Updated 10 months ago
- FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers☆398Updated this week
- ICCV 2025 ACTalker: an end-to-end video diffusion framework for talking head synthesis that supports both single and multi-signal control…☆375Updated this week
- Mobius: Text to Seamless Looping Video Generation via Latent Shift☆163Updated 3 months ago
- Generative models for conditional audio generation☆159Updated 6 months ago
- [CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation☆288Updated 2 months ago
- Official PyTorch Implementation of "Optimal Stepsize for Diffusion Sampling".☆185Updated 4 months ago
- Official code for AccVideo: Accelerating Video Diffusion Model with Synthetic Dataset☆257Updated 2 months ago
- Unlimited-length talking video generation that supports image-to-video and video-to-video generation☆185Updated this week
- We achieves high-quality first-frame guided video editing given a reference image, while maintaining flexibility for incorporating additi…☆299Updated last week
- [Official] Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off☆279Updated last week
- [ICCV 2025] Official Pytorch Implementation of FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait.☆369Updated last month