Cypress-Yang / SongBloomLinks

☆287

Alternatives and similar repositories for SongBloom

Users that are interested in SongBloom are comparing it to the libraries listed below

Sorting:

woct0rdho / ACE-Step
Fork of ACE-Step for LoRA training with < 10 GB VRAM
☆35Updated 3 weeks ago
Mozer / YuE-extend
YuE with mp3 extend, exllama and GUI
☆58Updated 6 months ago
Bill13579 / beltout
BeltOut: An open source pitch-perfect voice-to-voice timbre transfer model based on ChatterboxVC
☆76Updated last month
declare-lab / jamify
JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment
☆88Updated last month
yl4579 / DMOSpeech2
☆278Updated last month
tencent-ailab / SongGeneration
☆734Updated last month
shaopengw / Awesome-Music-Generation
Awesome music generation model——MG²
☆159Updated 5 months ago
haidog-yaqub / EzAudio
High-quality Text-to-Audio Generation with Efficient Diffusion Transformer
☆308Updated 2 months ago
LiuZH-19 / SongGen
[ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation
☆258Updated 2 months ago
deepbeepmeep / YuEGP
YuE: Open Full-song Generation Foundation for the GPU Poor
☆437Updated 7 months ago
joeljuvel / YuE-UI
Gradio UI for YuE
☆72Updated 5 months ago
HilaManor / AudioEditingCode
☆181Updated 8 months ago
nivibilla / local-llasa-tts
Examples of using the llasa-tts models locally
☆180Updated 4 months ago
Yusiissy / SonicVisionLM
☆75Updated last year
EmilianPostolache / stable-audio-controlnet
Fine-tune Stable Audio Open with DiT ControlNet.
☆245Updated 3 months ago
vibevoice-community / VibeVoice
Long-form conversational TTS | Community fork
☆204Updated last week
diodiogod / TTS-Audio-Suite
A ComfyUI custom node integration for multi-engine multi-language High-quality Text-to-Speech and Voice Conversion. Supports: RVC, Chatte…
☆143Updated this week
sgsdxzy / YuE-exllamav2
☆124Updated 6 months ago
alisson-anjos / YuE-Interface
YuE: Open Full-song Generation Foundation Model, something similar to Suno.ai but open
☆70Updated 4 months ago
curtified / FluxMusicGUI
Text-to-Music Generation with Rectified Flow Transformer
☆64Updated 3 months ago
YatingMusic / MusiConGen
☆81Updated 10 months ago
bytedance / Make-An-Audio-2
a text-conditional diffusion probabilistic model capable of generating high fidelity audio.
☆176Updated last year
RoyalCities / RC-stable-audio-tools
Generative models for conditional audio generation
☆161Updated 7 months ago
Tencent-Hunyuan / HunyuanPortrait
[CVPR-2025] The official code of HunyuanPortrait: Implicit Condition Control for Enhanced Portrait Animation
☆290Updated 3 months ago
declare-lab / TangoFlux
TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching
☆781Updated last month
deepbrainai-research / float
[ICCV 2025] Official Pytorch Implementation of FLOAT: Generative Motion Latent Flow Matching for Audio-driven Talking Portrait.
☆383Updated 2 months ago
Fantasy-AMAP / fantasy-portrait
FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers
☆454Updated 3 weeks ago
cyanbx / Prompt-Singer
Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).
☆113Updated 7 months ago
yannqi / Draw-an-Audio-Code
Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.
☆46Updated last year
YisuiTT / Mobius
Mobius: Text to Seamless Looping Video Generation via Latent Shift
☆164Updated 4 months ago