feizc/FluxMusic

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/feizc/FluxMusic)

feizc / FluxMusic

Text-to-Music Generation with Rectified Flow Transformers

☆1,713

Alternatives and similar repositories for FluxMusic

Users that are interested in FluxMusic are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ivcylc / OpenMusic
View on GitHub
OpenMusic: SOTA Text-to-music (TTM) Generation
☆629Jun 26, 2025Updated last year
Stability-AI / stable-audio-tools
View on GitHub
Generative models for conditional audio generation
☆3,829Updated this week
curtified / FluxMusicGUI
View on GitHub
Text-to-Music Generation with Rectified Flow Transformer
☆63May 26, 2025Updated last year
multimodal-art-projection / YuE
View on GitHub
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
☆6,345Jun 4, 2025Updated last year
QwenAudio / FunMusic
View on GitHub
A fundamental toolkit designed for music, song, and audio generation
☆1,371May 20, 2025Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
haoheliu / AudioLDM2
View on GitHub
Text-to-Audio/Music Generation
☆2,637Sep 29, 2024Updated last year
EmilianPostolache / stable-audio-controlnet
View on GitHub
Fine-tune Stable Audio Open with DiT ControlNet.
☆256May 16, 2025Updated last year
haidog-yaqub / EzAudio
View on GitHub
High-quality Text-to-Audio Generation with Efficient Diffusion Transformer
☆333Dec 17, 2025Updated 7 months ago
camenduru / FluxMusic-jupyter
View on GitHub
☆18Sep 4, 2024Updated last year
FireRedTeam / StoryMaker
View on GitHub
StoryMaker: Towards consistent characters in text-to-image generation
☆719Dec 2, 2024Updated last year
tencent-ailab / MuQ
View on GitHub
Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".
☆360Aug 4, 2025Updated 11 months ago
facebookresearch / audiobox-aesthetics
View on GitHub
Unified automatic quality assessment for speech, music, and sound.
☆747Jun 5, 2025Updated last year
declare-lab / TangoFlux
View on GitHub
[ICLR 2026] TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching
☆878Jan 28, 2026Updated 6 months ago
open-mmlab / Amphion
View on GitHub
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…
☆9,969Mar 25, 2026Updated 4 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jy0205 / Pyramid-Flow
View on GitHub
[ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling
☆3,200Dec 21, 2024Updated last year
ace-step / ACE-Step
View on GitHub
ACE-Step: A Step Towards Music Generation Foundation Model
☆4,693Feb 15, 2026Updated 5 months ago
Alpha-VLLM / Lumina-T2X
View on GitHub
Lumina-T2X is a unified framework for Text to Any Modality Generation
☆2,247Feb 16, 2025Updated last year
facebookresearch / audiocraft
View on GitHub
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…
☆23,525Mar 3, 2026Updated 4 months ago
sh-lee-prml / PeriodWave
View on GitHub
The official Implementation of PeriodWave and PeriodWave-Turbo
☆226Apr 14, 2025Updated last year
black-forest-labs / flux
View on GitHub
Official inference repo for FLUX.1 models
☆25,818Jul 31, 2025Updated 11 months ago
jishengpeng / WavTokenizer
View on GitHub
[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling
☆1,309Mar 2, 2025Updated last year
Audio-AGI / WavJourney
View on GitHub
WavJourney: Compositional Audio Creation with LLMs
☆544Sep 28, 2023Updated 2 years ago
ASLP-lab / DiffRhythm
View on GitHub
Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion
☆2,325Nov 27, 2025Updated 8 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
hugofloresgarcia / vampnet
View on GitHub
music generation with masked transformers!
☆357May 16, 2025Updated last year
SonyCSLParis / music2latent
View on GitHub
Encode and decode audio samples to/from compressed latent representations!
☆267Sep 19, 2025Updated 10 months ago
LiuZH-19 / SongGen
View on GitHub
[ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation
☆315Nov 5, 2025Updated 8 months ago
kyutai-labs / moshi
View on GitHub
Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audi…
☆10,747May 16, 2026Updated 2 months ago
yizhilll / MERT
View on GitHub
Official implementation of the paper "Acoustic Music Understanding Model with Large-Scale Self-supervised Training".
☆482May 25, 2025Updated last year
huggingface / parler-tts
View on GitHub
Inference and training library for high-quality TTS models.
☆5,580Dec 10, 2024Updated last year
shaopengw / Awesome-Music-Generation
View on GitHub
Awesome music generation model——MG²
☆166Mar 29, 2025Updated last year
yangdongchao / UniAudio
View on GitHub
The Open Source Code of UniAudio
☆605Jul 22, 2024Updated 2 years ago
jasonppy / VoiceCraft
View on GitHub
Zero-Shot Speech Editing and Text-to-Speech in the Wild
☆8,506May 30, 2026Updated last month
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
gpt-omni / mini-omni
View on GitHub
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming…
☆3,564Nov 5, 2024Updated last year
Standard-Intelligence / hertz-dev
View on GitHub
first base model for full-duplex conversational audio
☆1,794Jan 5, 2025Updated last year
zai-org / CogVideo
View on GitHub
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
☆12,918Nov 4, 2025Updated 8 months ago
LAION-AI / CLAP
View on GitHub
Contrastive Language-Audio Pretraining
☆2,233May 15, 2025Updated last year
NVIDIA / audio-flamingo
View on GitHub
PyTorch implementation of Audio Flamingo: Series of Advanced Audio Understanding Language Models
☆1,164Dec 15, 2025Updated 7 months ago
haoheliu / AudioLDM
View on GitHub
AudioLDM: Generate speech, sound effects, music and beyond, with text.
☆2,905Jun 25, 2025Updated last year
archinetai / audio-diffusion-pytorch
View on GitHub
Audio generation using diffusion models, in PyTorch.
☆2,095Jun 12, 2023Updated 3 years ago