ASLP-lab/DiffRhythm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ASLP-lab/DiffRhythm)

ASLP-lab / DiffRhythm

Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion

☆2,321

Alternatives and similar repositories for DiffRhythm

Users that are interested in DiffRhythm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

FunAudioLLM / FunMusic
View on GitHub
A fundamental toolkit designed for music, song, and audio generation
☆1,369May 20, 2025Updated last year
ASLP-lab / DiffRhythm2
View on GitHub
Di♪♪Rhythm 2: Efficient And High Fidelity Song Generation Via Block Flow Matching
☆166Nov 9, 2025Updated 8 months ago
multimodal-art-projection / YuE
View on GitHub
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
☆6,332Jun 4, 2025Updated last year
ace-step / ACE-Step
View on GitHub
ACE-Step: A Step Towards Music Generation Foundation Model
☆4,670Feb 15, 2026Updated 5 months ago
ASLP-lab / SongEval
View on GitHub
A song aesthetic evaluation toolkit trained on SongEval.
☆314Apr 8, 2026Updated 3 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
LiuZH-19 / SongGen
View on GitHub
[ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation
☆314Nov 5, 2025Updated 8 months ago
billwuhao / ComfyUI_DiffRhythm
View on GitHub
Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation. A node for ComfyUI.
☆153May 30, 2025Updated last year
tencent-ailab / MuQ
View on GitHub
Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".
☆357Aug 4, 2025Updated 11 months ago
ASLP-lab / MeanVC
View on GitHub
A Lightweight and Streaming Zero-Shot Voice Conversion via Mean Flows
☆296Jan 8, 2026Updated 6 months ago
ASLP-lab / VoiceSculptor
View on GitHub
An instruct text-to-speech solution based on LLaSA and CosyVoice2 developed by the ASLP lab and collaborators.
☆250Feb 26, 2026Updated 4 months ago
ASLP-lab / MINT-Bench
View on GitHub
☆48May 2, 2026Updated 2 months ago
tencent-ailab / MuCodec
View on GitHub
☆168Nov 22, 2024Updated last year
xiaomi-research / diffrhythm2
View on GitHub
☆122Nov 6, 2025Updated 8 months ago
ElectricAlexis / NotaGen
View on GitHub
NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms
☆1,212Apr 21, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
declare-lab / jamify
View on GitHub
JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment
☆167Aug 7, 2025Updated 11 months ago
ASLP-lab / Speaker-Reasoner
View on GitHub
Speaker-Reasoner: Scaling Interaction Turns and Reasoning Patterns for Timestamped Speaker-Attributed ASR
☆93May 13, 2026Updated 2 months ago
ASLP-lab / OSUM
View on GitHub
OSUM & OSUM-EChat, open speech understanding model and empathetic spoken chatbot based on it, open-sourced by ASLP@NPU.
☆494Nov 23, 2025Updated 7 months ago
facebookresearch / audiobox-aesthetics
View on GitHub
Unified automatic quality assessment for speech, music, and sound.
☆744Jun 5, 2025Updated last year
Stability-AI / stable-audio-tools
View on GitHub
Generative models for conditional audio generation
☆3,818Jul 13, 2026Updated last week
ivcylc / OpenMusic
View on GitHub
OpenMusic: SOTA Text-to-music (TTM) Generation
☆630Jun 26, 2025Updated last year
declare-lab / TangoFlux
View on GitHub
[ICLR 2026] TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching
☆876Jan 28, 2026Updated 5 months ago
ZeyueT / AudioX
View on GitHub
[ICLR 2026] Repository of AudioX
☆1,542Mar 10, 2026Updated 4 months ago
ASLP-lab / LLaSA_Plus
View on GitHub
Llasa Speed Up
☆64Jan 18, 2026Updated 6 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
zhenye234 / X-Codec-2.0
View on GitHub
Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
☆360Jun 25, 2026Updated 3 weeks ago
ASLP-lab / WenetSpeech-Wu-Repo
View on GitHub
A Large-scale Wu Dialect Speech Corpus with Multi-dimensional Annotations
☆170Feb 6, 2026Updated 5 months ago
ASLP-lab / SongFormer
View on GitHub
☆164May 14, 2026Updated 2 months ago
Tencent / SongBench
View on GitHub
☆50Apr 30, 2026Updated 2 months ago
Plachtaa / seed-vc
View on GitHub
zero-shot voice conversion & singing voice conversion, with real-time support
☆3,878Apr 20, 2025Updated last year
facebookresearch / FlowDec
View on GitHub
An neural full-band audio codec for general audio sampled at 48 kHz with 7.5 kps or 4.5 kbps.
☆212Jun 22, 2026Updated 3 weeks ago
Stability-AI / stable-codec
View on GitHub
A family of state-of-the-art Transformer-based audio codecs for low-bitrate high-quality audio coding.
☆436Updated this week
NVIDIA / audio-flamingo
View on GitHub
PyTorch implementation of Audio Flamingo: Series of Advanced Audio Understanding Language Models
☆1,155Dec 15, 2025Updated 7 months ago
sanderwood / clamp3
View on GitHub
CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages [ACL 2025]
☆249May 11, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
hkchengrex / MMAudio
View on GitHub
[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis
☆2,241Feb 23, 2026Updated 4 months ago
gwx314 / TechSinger
View on GitHub
TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching
☆100Apr 2, 2026Updated 3 months ago
ASLP-lab / LLaSE-G1
View on GitHub
LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement
☆47Mar 10, 2025Updated last year
KdaiP / StableTTS
View on GitHub
Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3
☆438Sep 13, 2024Updated last year
magenta / magenta-realtime
View on GitHub
Magenta RealTime 2: An Open-Weights Live Music Model
☆1,688Updated this week
descriptinc / descript-audio-codec
View on GitHub
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
☆1,839Updated this week
MrSupW / ContextASR-Bench
View on GitHub
A Massive Contextual Speech Recognition Benchmark.
☆107Aug 6, 2025Updated 11 months ago