tencent-ailab / SongBloomLinks
The official code repository for SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement
☆746Updated 2 months ago
Alternatives and similar repositories for SongBloom
Users that are interested in SongBloom are comparing it to the libraries listed below
Sorting:
- The official code repository for LeVo: High-Quality Song Generation with Multi-Preference Alignment☆1,337Updated 2 months ago
- A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics…☆870Updated this week
- Unofficial WIP LoRa Finetuning repository for VibeVoice☆344Updated 4 months ago
- Fork of ACE-Step v1.0 for LoRA training with < 10 GB VRAM☆63Updated last week
- YuE: Open Full-song Generation Foundation for the GPU Poor☆462Updated 11 months ago
- Gradio UI for YuE☆89Updated 10 months ago
- [ICLR 2026] TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching☆831Updated 2 weeks ago
- ☆135Updated 11 months ago
- A lightning fast audio upsampler.☆710Updated last week
- YuE with mp3 extend, exllama and GUI☆64Updated 11 months ago
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆329Updated last month
- VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)☆965Updated 3 weeks ago
- A ComfyUI custom node integration for multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, Qwen3-TTS, Cozy Voi…☆630Updated this week
- [ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation☆299Updated 3 months ago
- Generative models for conditional audio generation☆166Updated 2 weeks ago
- Examples of using the llasa-tts models locally☆182Updated 9 months ago
- [IJCV] FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝☆644Updated last year
- Repository of AudioX☆1,132Updated 9 months ago
- FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers☆501Updated 5 months ago
- ☆537Updated 4 months ago
- Echo-TTS inference codebase☆104Updated 2 months ago
- ☆297Updated 6 months ago
- YuE: Open Full-song Generation Foundation Model, something similar to Suno.ai but open☆77Updated 9 months ago
- MoCha: End-to-End Video Character Replacement without Structural Guidance☆635Updated 3 weeks ago
- The official code repository for SongPrep: A Preprocessing Framework and End-to-end Model for Full-song Structure Parsing and Lyrics Tran…☆152Updated 2 months ago
- We present FlashPortrait, an end-to-end video diffusion transformer capable of synthesizing ID-preserving, infinite-length videos while a…☆434Updated last month
- YuE: Open Full-song Generation Foundation Model, something similar to Suno.ai but open☆60Updated 11 months ago
- BeltOut: An open source pitch-perfect voice-to-voice timbre transfer model based on ChatterboxVC☆79Updated 6 months ago
- [SIGGRAPH Asia 25] Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off☆333Updated 3 months ago
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆105Updated 2 months ago