Cypress-Yang / SongBloomLinks
☆287Updated 2 months ago
Alternatives and similar repositories for SongBloom
Users that are interested in SongBloom are comparing it to the libraries listed below
Sorting:
- Fork of ACE-Step for LoRA training with < 10 GB VRAM☆35Updated 3 weeks ago
- YuE with mp3 extend, exllama and GUI☆58Updated 6 months ago
- BeltOut: An open source pitch-perfect voice-to-voice timbre transfer model based on ChatterboxVC☆76Updated last month
- JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment☆88Updated last month
- ☆278Updated last month
- ☆734Updated last month
- Awesome music generation model——MG²☆159Updated 5 months ago
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆308Updated 2 months ago
- [ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation☆258Updated 2 months ago
- YuE: Open Full-song Generation Foundation for the GPU Poor☆437Updated 7 months ago
- Gradio UI for YuE☆72Updated 5 months ago
- ☆181Updated 8 months ago
- Examples of using the llasa-tts models locally☆180Updated 4 months ago
- ☆75Updated last year
- Fine-tune Stable Audio Open with DiT ControlNet.☆245Updated 3 months ago
- Long-form conversational TTS | Community fork☆204Updated last week
- A ComfyUI custom node integration for multi-engine multi-language High-quality Text-to-Speech and Voice Conversion. Supports: RVC, Chatte…☆143Updated this week
- ☆124Updated 6 months ago
- YuE: Open Full-song Generation Foundation Model, something similar to Suno.ai but open☆70Updated 4 months ago
- Text-to-Music Generation with Rectified Flow Transformer☆64Updated 3 months ago
- ☆81Updated 10 months ago
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆176Updated last year
- Generative models for conditional audio generation☆161Updated 7 months ago
- [CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation☆290Updated 3 months ago
- TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching☆781Updated last month
- [ICCV 2025] Official Pytorch Implementation of FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait.☆383Updated 2 months ago
- FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers☆454Updated 3 weeks ago
- Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).☆113Updated 7 months ago
- Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.☆46Updated last year
- Mobius: Text to Seamless Looping Video Generation via Latent Shift☆164Updated 4 months ago