tencent-ailab/SongBloom

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tencent-ailab/SongBloom)

tencent-ailab / SongBloom

The official code repository for SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement

☆784

Alternatives and similar repositories for SongBloom

Users that are interested in SongBloom are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

tencent-ailab / MuQ
View on GitHub
Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".
☆347Aug 4, 2025Updated 10 months ago
ace-step / ACE-Step
View on GitHub
ACE-Step: A Step Towards Music Generation Foundation Model
☆4,581Feb 15, 2026Updated 4 months ago
tencent-ailab / MuCodec
View on GitHub
☆162Nov 22, 2024Updated last year
tencent-ailab / SongGeneration
View on GitHub
The official code repository for LeVo: High-Quality Song Generation with Multi-Preference Alignment
☆1,654Mar 12, 2026Updated 3 months ago
LiuZH-19 / SongGen
View on GitHub
[ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation
☆312Nov 5, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ASLP-lab / DiffRhythm
View on GitHub
Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion
☆2,308Nov 27, 2025Updated 6 months ago
FunAudioLLM / FunMusic
View on GitHub
A fundamental toolkit designed for music, song, and audio generation
☆1,359May 20, 2025Updated last year
Kunbyte-AI / DRA-Ctrl
View on GitHub
Official Implementation of DRA-Ctrl (Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis)
☆119Aug 15, 2025Updated 10 months ago
declare-lab / jamify
View on GitHub
JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment
☆164Aug 7, 2025Updated 10 months ago
tencent-ailab / SongPrep
View on GitHub
The official code repository for SongPrep: A Preprocessing Framework and End-to-end Model for Full-song Structure Parsing and Lyrics Tran…
☆161Dec 8, 2025Updated 6 months ago
neph1 / finetrainers-ui
View on GitHub
Gradio UI for training video models using finetrainers
☆33Apr 18, 2025Updated last year
AMAAI-Lab / SonicVerse
View on GitHub
SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning
☆53Jul 28, 2025Updated 10 months ago
ASLP-lab / SongEval
View on GitHub
A song aesthetic evaluation toolkit trained on SongEval.
☆309Apr 8, 2026Updated 2 months ago
Yukun-Huang / DreamCube
View on GitHub
[ICCV 2025] Official implementation of the paper "DreamCube: 3D Panorama Generation via Multi-plane Synchronization".
☆178Feb 4, 2026Updated 4 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
sanderwood / clamp3
View on GitHub
CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages [ACL 2025]
☆245May 11, 2025Updated last year
ZhikangNiu / A-DMA
View on GitHub
[INTERSPEECH 2025 Oral]Official code for "Accelerating Diffusion-based Text-to-Speech Model Training with Dual Modality Alignment"
☆67Jun 16, 2025Updated last year
sanderwood / melodyt5
View on GitHub
MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]
☆47Jan 23, 2025Updated last year
Mddct / transformer-vocos
View on GitHub
☆36Sep 6, 2025Updated 9 months ago
ZhikangNiu / Semantic-VAE
View on GitHub
[INTERSPEECH 2026]Official code for "Semantic-VAE: Semantic-Alignment Latent Representation for Better Speech Synthesis"
☆112Jun 3, 2026Updated 2 weeks ago
facebookresearch / audiobox-aesthetics
View on GitHub
Unified automatic quality assessment for speech, music, and sound.
☆727Jun 5, 2025Updated last year
minzwon / musicfm
View on GitHub
☆263Feb 14, 2024Updated 2 years ago
multimodal-art-projection / YuE
View on GitHub
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
☆6,277Jun 4, 2025Updated last year
rishikksh20 / MiniMax-TTS-pytorch
View on GitHub
Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report
☆47Sep 2, 2025Updated 9 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
bytedance / Make-An-Audio-2
View on GitHub
a text-conditional diffusion probabilistic model capable of generating high fidelity audio.
☆198May 29, 2024Updated 2 years ago
zengchang233 / xiaoicesing2
View on GitHub
The source code for the paper XiaoiceSing2 (interspeech2023)
☆49Jan 15, 2024Updated 2 years ago
Plachtaa / ASTRAL-quantization
View on GitHub
speaker-disentangled speech linguistic content quantizer
☆25Mar 19, 2025Updated last year
zhenye234 / xcodec
View on GitHub
AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
☆304Oct 12, 2025Updated 8 months ago
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
declare-lab / TangoFlux
View on GitHub
[ICLR 2026] TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching
☆868Jan 28, 2026Updated 4 months ago
flamed-tts / Flamed-TTS
View on GitHub
This repository implement a novel zero-shot TTS framework, named Flamed-TTS, focusing on the efficient generation and dynamic pacing in …
☆57Aug 9, 2025Updated 10 months ago
KdaiP / StableTTS
View on GitHub
Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3
☆437Sep 13, 2024Updated last year
Audio-Foundation-Models / ConversationTTS
View on GitHub
☆101Jan 19, 2026Updated 4 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
CarlWangChina / QwenFeat-Vocal-Score
View on GitHub
VocalVerse: A powerful vocal evaluation framework powered by the Qwen LLMs
☆47May 11, 2026Updated last month
lmxue / Audio-FLAN
View on GitHub
Audio-FLAN
☆161Sep 23, 2025Updated 8 months ago
yuhui1038 / Muse
View on GitHub
ACL 2026 - Muse: Towards Reproducible Long-Form Song Generation with Fine-Grained Style Control
☆118Apr 11, 2026Updated 2 months ago
adelacvg / DPTTS
View on GitHub
An AR+AR TTS attempt.
☆18Jan 13, 2025Updated last year
anima-x / anima-x
View on GitHub
Official Implementation of [AnimaX: Animating the Inanimate in 3D with Joint Video-Pose Diffusion Models]
☆296Jun 25, 2025Updated 11 months ago
ZeyueT / AudioX
View on GitHub
[ICLR 2026] Repository of AudioX
☆1,524Mar 10, 2026Updated 3 months ago
Fantasy-AMAP / fantasy-portrait
View on GitHub
FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers
☆508Aug 20, 2025Updated 9 months ago