yuhui1038/Muse

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yuhui1038/Muse)

yuhui1038 / Muse

ACL 2026 - Muse: Towards Reproducible Long-Form Song Generation with Fine-Grained Style Control

☆119

Alternatives and similar repositories for Muse

Users that are interested in Muse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lsfhuihuiff / SongEcho_ICLR2026
View on GitHub
Official code for SongEcho
☆64Mar 3, 2026Updated 4 months ago
xiquan-li / Resonate
View on GitHub
[INTERSPEECH 2026] Pre-training, SFT, DPO and GRPO for Text-to-Audio Generation
☆48Apr 17, 2026Updated 3 months ago
Ruiqi-Yan / Awesome-Audio-Editing
View on GitHub
A curated list of models, benchmarks, tools and guides for audio editing
☆35Jul 7, 2026Updated 3 weeks ago
Tencent / SongBench
View on GitHub
☆51Apr 30, 2026Updated 2 months ago
CarlWangChina / QwenFeat-Vocal-Score
View on GitHub
VocalVerse: A powerful vocal evaluation framework powered by the Qwen LLMs
☆49May 11, 2026Updated 2 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Eps-Acoustic-Revolution-Lab / EAR_VAE
View on GitHub
[INTERSPEECH 2026] This is the official implementation for εar-VAE model including inference and evaluation parts, more details coming so…
☆90Feb 13, 2026Updated 5 months ago
tencent-ailab / MuCodec
View on GitHub
☆169Nov 22, 2024Updated last year
wx9Songs / MOSS-Music-Data-Pipeline
View on GitHub
☆44Apr 26, 2026Updated 3 months ago
AmphionTeam / TaDiCodec
View on GitHub
This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…
☆77Jan 25, 2026Updated 6 months ago
lonzi / mrflow_dpo
View on GitHub
☆22Jan 3, 2026Updated 6 months ago
juhayna-zh / AudioControlNet
View on GitHub
Official repository for the paper "Audio ControlNet for Fine-Grained Audio Generation and Editing".
☆77Feb 7, 2026Updated 5 months ago
ASLP-lab / FlashTTS
View on GitHub
Fast Streaming TTS with MTP Acceleration and X-pred Mean Flow Distillation
☆67Jun 16, 2026Updated last month
wsntxxn / UniFlow-Audio
View on GitHub
☆74Jul 17, 2026Updated last week
declare-lab / jamify
View on GitHub
JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment
☆167Aug 7, 2025Updated 11 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tencent-ailab / MuQ
View on GitHub
Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".
☆360Aug 4, 2025Updated 11 months ago
AmphionTeam / AnyAccomp
View on GitHub
AnyAccomp: Generalizable accompaniment generation for vocals and solo instruments, powered by a quantized melodic bottleneck.
☆39Dec 22, 2025Updated 7 months ago
ASLP-lab / SongEval
View on GitHub
A song aesthetic evaluation toolkit trained on SongEval.
☆315Apr 8, 2026Updated 3 months ago
xiaomi-research / diffrhythm2
View on GitHub
☆122Nov 6, 2025Updated 8 months ago
GiantAILab / YingMusic-SVC
View on GitHub
Official implementation of YingMusic-SVC.
☆155Dec 29, 2025Updated 7 months ago
k2-fsa / Flow2GAN
View on GitHub
Hybrid Flow Matching and GAN with Multi-Resolution Network for Few-Step High-Fidelity Audio Generation
☆146Mar 8, 2026Updated 4 months ago
xiaomi-research / dasheng-tokenizer
View on GitHub
State-of-the-art continious audio tokenization
☆40Mar 9, 2026Updated 4 months ago
Eps-Acoustic-Revolution-Lab / DUO_TOK
View on GitHub
Official repository for “Duo-Tok: Dual-Track Semantic Music Tokenizer for Vocal–Accompaniment Generation.”
☆32Nov 26, 2025Updated 8 months ago
ASLP-lab / SongFormer
View on GitHub
☆164May 14, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Audio-Foundation-Models / ConversationTTS
View on GitHub
☆101Jan 19, 2026Updated 6 months ago
ZhikangNiu / Semantic-VAE
View on GitHub
[INTERSPEECH 2026 Oral]Official code for "Semantic-VAE: Semantic-Alignment Latent Representation for Better Speech Synthesis"
☆121Jun 21, 2026Updated last month
xiquan-li / FineLAP
View on GitHub
[ACL 2026 Main] FineLAP: Taming Heterogeneous Supervision for Fine-grained Language-Audio Pre-training
☆36Apr 20, 2026Updated 3 months ago
zhaoyx239 / X-Translator
View on GitHub
☆26Jul 21, 2026Updated last week
OpenMOSS / MOSS-Audio-Tokenizer
View on GitHub
MOSS-Audio-Tokenizer is a Causal Transformer-based audio tokenizer built on the CAT architecture. Trained on 3M hours of diverse audio, i…
☆248Jun 16, 2026Updated last month
xiquan-li / MeanAudio
View on GitHub
[ACL 2026 Main] MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows
☆145Sep 2, 2025Updated 10 months ago
Shy-98 / MELLE
View on GitHub
Unofficial PyTorch implementation of "Autoregressive Speech Synthesis without Vector Quantization (MELLE)"
☆41Jun 28, 2025Updated last year
OpenMOSS / MOSS-Music
View on GitHub
MOSS-Music is an open-source music understanding model for targeting musical captioning, lyrics ASR, structural analysis, chord / key / t…
☆125May 9, 2026Updated 2 months ago
ASLP-lab / YingMusic-Singer-Plus
View on GitHub
YingMusic-Singer-Plus: Controllable Singing Voice Synthesis with Flexible Lyric Manipulation and Annotation-free Melody Guidance
☆82Apr 12, 2026Updated 3 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
fundwotsai2001 / Text-to-Music_control_family
View on GitHub
Containing SOTA methods that follows time-varying conditions for Text-to-Music
☆24Jan 1, 2026Updated 6 months ago
Andong-Li-speech / BridgeVoC
View on GitHub
This is the repository for the work "BridgeVoC: Revitalizing Neural Vocoder from a Restoration Perspective".
☆67Nov 5, 2025Updated 8 months ago
zeyuxie29 / SemanticVocoder
View on GitHub
☆28Apr 6, 2026Updated 3 months ago
tencent-ailab / SongPrep
View on GitHub
The official code repository for SongPrep: A Preprocessing Framework and End-to-end Model for Full-song Structure Parsing and Lyrics Tran…
☆171Dec 8, 2025Updated 7 months ago
yanghaha0908 / WavCube
View on GitHub
Official code for "WavCube: Unifying Speech Representation for Understanding and Generation via Semantic-Acoustic Joint Modeling"
☆62Jun 27, 2026Updated last month
facebookresearch / audiobox-aesthetics
View on GitHub
Unified automatic quality assessment for speech, music, and sound.
☆747Jun 5, 2025Updated last year
facebookresearch / WavFlow
View on GitHub
MultiModal Audio Generation in Raw Waveform Space.
☆154May 26, 2026Updated 2 months ago