ace-step / ACE-Step
ACE-Step: A Step Towards Music Generation Foundation Model
☆936Updated this week
Alternatives and similar repositories for ACE-Step:
Users that are interested in ACE-Step are comparing it to the libraries listed below
- ☆108Updated last month
- Examples of using the llasa-tts models locally☆168Updated 2 weeks ago
- YuE: Open Full-song Generation Foundation for the GPU Poor☆383Updated 2 months ago
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆268Updated 3 weeks ago
- Generative models for conditional audio generation☆151Updated 3 months ago
- ☆254Updated this week
- Gradio UI for YuE☆45Updated last month
- ☆96Updated last year
- Awesome music generation model——MG²☆154Updated last month
- YuE: Open Full-song Generation Foundation Model, something similar to Suno.ai but open☆60Updated last week
- ☆748Updated last week
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆324Updated 2 weeks ago
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆89Updated last month
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆546Updated last month
- YuE with mp3 extend, exllama and GUI☆48Updated 2 months ago
- ☆221Updated last month
- Run Orpheus 3B Locally With LM Studio☆392Updated last month
- TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching☆716Updated 2 months ago
- G2P☆227Updated last week
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆141Updated last month
- OpenMusic: SOTA Text-to-music (TTM) Generation☆557Updated last week
- Text-to-Music Generation with Rectified Flow Transformer☆62Updated 8 months ago
- Adding timestamped prompts and general quality of life features to FramePack https://discord.gg/MtuM7gFJ3V☆125Updated this week
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆61Updated last month
- InspireMusic: A Unified Framework for Music, Song, Audio Generation.☆1,081Updated this week
- Interface for OuteTTS models.☆1,209Updated last week
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆132Updated 3 weeks ago
- The New Stable Diffusion Audio Sampler 1.0 In a ComfyUI Node. Make some beats!☆251Updated 4 months ago
- Symbolic Music Generation, NotaGen node for ComfyUI.☆37Updated this week
- HunyuanVideo GP: Large Video Generation Model - GPU Poor version☆404Updated last month