FunAudioLLM/FunMusic

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/FunAudioLLM/FunMusic)

FunAudioLLM / FunMusic

A fundamental toolkit designed for music, song, and audio generation

☆1,369

Alternatives and similar repositories for FunMusic

Users that are interested in FunMusic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

multimodal-art-projection / YuE
View on GitHub
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
☆6,332Jun 4, 2025Updated last year
LiuZH-19 / SongGen
View on GitHub
[ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation
☆314Nov 5, 2025Updated 8 months ago
ASLP-lab / DiffRhythm
View on GitHub
Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion
☆2,321Nov 27, 2025Updated 7 months ago
tencent-ailab / MuQ
View on GitHub
Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".
☆357Aug 4, 2025Updated 11 months ago
tencent-ailab / MuCodec
View on GitHub
☆168Nov 22, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zhenye234 / X-Codec-2.0
View on GitHub
Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis
☆360Jun 25, 2026Updated 3 weeks ago
zhenye234 / xcodec
View on GitHub
AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model
☆308Oct 12, 2025Updated 9 months ago
facebookresearch / audiobox-aesthetics
View on GitHub
Unified automatic quality assessment for speech, music, and sound.
☆744Jun 5, 2025Updated last year
yizhilll / MERT
View on GitHub
Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".
☆480May 25, 2025Updated last year
declare-lab / TangoFlux
View on GitHub
[ICLR 2026] TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching
☆876Jan 28, 2026Updated 5 months ago
ace-step / ACE-Step
View on GitHub
ACE-Step: A Step Towards Music Generation Foundation Model
☆4,670Feb 15, 2026Updated 5 months ago
EmilianPostolache / stable-audio-controlnet
View on GitHub
Fine-tune Stable Audio Open with DiT ControlNet.
☆256May 16, 2025Updated last year
ivcylc / OpenMusic
View on GitHub
OpenMusic: SOTA Text-to-music (TTM) Generation
☆630Jun 26, 2025Updated last year
sanderwood / clamp3
View on GitHub
CLaMP 3: Universal Music Information Retrieval Across Unaligned Modalities and Unseen Languages [ACL 2025]
☆249May 11, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Stability-AI / stable-audio-tools
View on GitHub
Generative models for conditional audio generation
☆3,818Jul 13, 2026Updated last week
haidog-yaqub / EzAudio
View on GitHub
High-quality Text-to-Audio Generation with Efficient Diffusion Transformer
☆333Dec 17, 2025Updated 7 months ago
modelscope / ClearerVoice-Studio
View on GitHub
An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Spe…
☆4,315Aug 14, 2025Updated 11 months ago
Plachtaa / seed-vc
View on GitHub
zero-shot voice conversion & singing voice conversion, with real-time support
☆3,878Apr 20, 2025Updated last year
HeCheng0625 / Diffusion-Speech-Tokenizer
View on GitHub
This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…
☆198Jan 25, 2026Updated 5 months ago
sanderwood / melodyt5
View on GitHub
MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]
☆49Jan 23, 2025Updated last year
shivammehta25 / Matcha-TTS
View on GitHub
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
☆1,333Jul 13, 2026Updated last week
NVIDIA / audio-flamingo
View on GitHub
PyTorch implementation of Audio Flamingo: Series of Advanced Audio Understanding Language Models
☆1,155Dec 15, 2025Updated 7 months ago
FireRedTeam / FireRedTTS
View on GitHub
An Open-Sourced LLM-empowered Foundation TTS System
☆908Sep 28, 2025Updated 9 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
lmxue / Audio-FLAN
View on GitHub
Audio-FLAN
☆161Sep 23, 2025Updated 9 months ago
yangdongchao / LLM-Codec
View on GitHub
The open source code for LLM-Codec
☆147Aug 18, 2024Updated last year
descriptinc / descript-audio-codec
View on GitHub
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
☆1,839Updated this week
LqNoob / Neural-Codec-and-Speech-Language-Models
View on GitHub
Awesome Neural Codec Models, Text-to-Speech Synthesizers & Speech Language Models
☆246Jul 9, 2026Updated last week
Text-to-Audio / Make-An-Audio-3
View on GitHub
Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers
☆121May 19, 2025Updated last year
xingchensong / S3Tokenizer
View on GitHub
Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice
☆517Dec 22, 2025Updated 6 months ago
haoheliu / SemantiCodec-inference
View on GitHub
Ultra-low bitrate neural audio codec (0.31~1.40 kbps) with a better semantic in the latent space.
☆254Mar 7, 2025Updated last year
xiaomi-research / diffrhythm2
View on GitHub
☆122Nov 6, 2025Updated 8 months ago
xiquan-li / MeanAudio
View on GitHub
[ACL 2026 Main] MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows
☆142Sep 2, 2025Updated 10 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
FunAudioLLM / ThinkSound
View on GitHub
[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Tho…
☆1,373Apr 3, 2026Updated 3 months ago
ASLP-lab / SongEval
View on GitHub
A song aesthetic evaluation toolkit trained on SongEval.
☆314Apr 8, 2026Updated 3 months ago
ZeyueT / AudioX
View on GitHub
[ICLR 2026] Repository of AudioX
☆1,542Mar 10, 2026Updated 4 months ago
FunAudioLLM / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆22,292May 25, 2026Updated last month
JishengBai / AudioSetCaps
View on GitHub
A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline
☆208Dec 13, 2024Updated last year
ASLP-lab / MeanVC
View on GitHub
A Lightweight and Streaming Zero-Shot Voice Conversion via Mean Flows
☆296Jan 8, 2026Updated 6 months ago
yangdongchao / UniAudio
View on GitHub
The Open Source Code of UniAudio
☆606Jul 22, 2024Updated last year