tencent-ailab/SongGeneration

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tencent-ailab/SongGeneration)

tencent-ailab / SongGeneration

The official code repository for LeVo: High-Quality Song Generation with Multi-Preference Alignment

☆1,654

Alternatives and similar repositories for SongGeneration

Users that are interested in SongGeneration are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

smthemex / ComfyUI_SongGeneration
View on GitHub
SongGeneration:High-Quality Song Generation with Multi-Preference Alignment (SOTA),you can try VRAM>12G
☆158Mar 21, 2026Updated 2 months ago
ASLP-lab / DiffRhythm
View on GitHub
Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion
☆2,308Nov 27, 2025Updated 6 months ago
ace-step / ACE-Step
View on GitHub
ACE-Step: A Step Towards Music Generation Foundation Model
☆4,565Feb 15, 2026Updated 4 months ago
FunAudioLLM / FunMusic
View on GitHub
A fundamental toolkit designed for music, song, and audio generation
☆1,355May 20, 2025Updated last year
tencent-ailab / SongBloom
View on GitHub
The official code repository for SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement
☆784Dec 4, 2025Updated 6 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
LiuZH-19 / SongGen
View on GitHub
[ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation
☆312Nov 5, 2025Updated 7 months ago
tencent-ailab / MuQ
View on GitHub
Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".
☆347Aug 4, 2025Updated 10 months ago
multimodal-art-projection / YuE
View on GitHub
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
☆6,277Jun 4, 2025Updated last year
Stability-AI / stable-audio-tools
View on GitHub
Generative models for conditional audio generation
☆3,766May 26, 2026Updated 2 weeks ago
xiaomi-research / diffrhythm2
View on GitHub
☆120Nov 6, 2025Updated 7 months ago
woct0rdho / ACE-Step
View on GitHub
Fork of ACE-Step v1.0 for LoRA training with < 10 GB VRAM
☆68Feb 3, 2026Updated 4 months ago
FunAudioLLM / ThinkSound
View on GitHub
[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Tho…
☆1,366Apr 3, 2026Updated 2 months ago
facebookresearch / audiobox-aesthetics
View on GitHub
Unified automatic quality assessment for speech, music, and sound.
☆727Jun 5, 2025Updated last year
tencent-ailab / SongPrep
View on GitHub
The official code repository for SongPrep: A Preprocessing Framework and End-to-end Model for Full-song Structure Parsing and Lyrics Tran…
☆160Dec 8, 2025Updated 6 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tencent-ailab / MuCodec
View on GitHub
☆162Nov 22, 2024Updated last year
ASLP-lab / SongEval
View on GitHub
A song aesthetic evaluation toolkit trained on SongEval.
☆307Apr 8, 2026Updated 2 months ago
Audio-Foundation-Models / ConversationTTS
View on GitHub
☆101Jan 19, 2026Updated 4 months ago
meaningTeam / tidy-tunes
View on GitHub
Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …
☆23May 19, 2026Updated 3 weeks ago
yuhui1038 / Muse
View on GitHub
ACL 2026 - Muse: Towards Reproducible Long-Form Song Generation with Fine-Grained Style Control
☆118Apr 11, 2026Updated 2 months ago
Plachtaa / seed-vc
View on GitHub
zero-shot voice conversion & singing voice conversion, with real-time support
☆3,802Apr 20, 2025Updated last year
MeiGen-AI / MultiTalk
View on GitHub
[NeurIPS 2025] Let Them Talk: Audio-Driven Multi-Person Conversational Video Generation
☆2,945May 22, 2026Updated 3 weeks ago
jiaqili3 / DualCodec
View on GitHub
[Interspeech 2025] DualCodec: A Low-Frame-Rate, Semantically-Enhanced Neural Audio Codec
☆68Mar 11, 2026Updated 3 months ago
zengchang233 / xiaoicesing2
View on GitHub
The source code for the paper XiaoiceSing2 (interspeech2023)
☆49Jan 15, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
caizexin / GenVC
View on GitHub
Self-supervised Generative LM-based Voice Conversion
☆58Apr 24, 2025Updated last year
ASLP-lab / MeanVC
View on GitHub
A Lightweight and Streaming Zero-Shot Voice Conversion via Mean Flows
☆280Jan 8, 2026Updated 5 months ago
ZeyueT / AudioX
View on GitHub
[ICLR 2026] Repository of AudioX
☆1,524Mar 10, 2026Updated 3 months ago
declare-lab / TangoFlux
View on GitHub
[ICLR 2026] TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching
☆868Jan 28, 2026Updated 4 months ago
KdaiP / StableTTS
View on GitHub
Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3
☆437Sep 13, 2024Updated last year
freds0 / free-svc
View on GitHub
[ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion
☆94Jul 23, 2025Updated 10 months ago
qiuqiangkong / audioflow
View on GitHub
☆121Updated this week
zhenye234 / X-Codec-2.0
View on GitHub
Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
☆358Jul 21, 2025Updated 10 months ago
WX-Wei / HarmoF0
View on GitHub
☆107Aug 23, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
LqNoob / Neural-Codec-and-Speech-Language-Models
View on GitHub
Awesome Neural Codec Models, Text-to-Speech Synthesizers & Speech Language Models
☆242Dec 18, 2025Updated 5 months ago
Fantasy-AMAP / fantasy-talking
View on GitHub
[ACM MM 2025] FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis
☆1,621Jan 26, 2026Updated 4 months ago
RickyL-2000 / AlignSTS
View on GitHub
Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment
☆68Jul 5, 2024Updated last year
hkchengrex / MMAudio
View on GitHub
[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
☆2,206Feb 23, 2026Updated 3 months ago
descriptinc / descript-audio-codec
View on GitHub
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
☆1,814Jan 26, 2026Updated 4 months ago
open-mmlab / Amphion
View on GitHub
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…
☆9,837Mar 25, 2026Updated 2 months ago
JusperLee / Gull-Codec-Training
View on GitHub
☆12Mar 11, 2025Updated last year