ace-step / ACE-Step-1.5Links
The most powerful local music generation model that outperforms most commercial alternatives
☆1,959Updated this week
Alternatives and similar repositories for ACE-Step-1.5
Users that are interested in ACE-Step-1.5 are comparing it to the libraries listed below
Sorting:
- A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice …☆1,354Updated 4 months ago
- The official code repository for LeVo: High-Quality Song Generation with Multi-Preference Alignment☆1,337Updated last month
- Official Python inference and LoRA trainer package for the LTX-2 audio–video generative model.☆3,485Updated last week
- VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)☆964Updated 2 weeks ago
- ☆979Updated last month
- Suno-like music generation studio for HeartMuLa/heartlib - AI-powered music creation with reference audio style transfer☆434Updated last week
- A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.☆632Updated last week
- PersonaLive! : Expressive Portrait Image Animation for Live Streaming☆1,612Updated last month
- The official code repository for SongBloom: Coherent Song Generation via Interleaved Autoregressive Sketching and Diffusion Refinement☆746Updated 2 months ago
- [ICLR 26] Stable Video Infinity: Infinite-Length Video Generation with Error Recycling☆1,961Updated 3 weeks ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆3,873Updated last week
- YuE: Open Full-song Generation Foundation for the GPU Poor☆462Updated 11 months ago
- ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio☆555Updated 4 months ago
- A Simple Implementation of Qwen3-TTS's ComfyUI☆949Updated this week
- A ComfyUI custom node integration for multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, Qwen3-TTS, Cozy Voi…☆616Updated this week
- A lightning fast audio upsampler.☆697Updated this week
- Soprano: Instant, Ultra-Realistic Text-to-Speech☆1,164Updated 3 weeks ago
- 🎵 The Ultimate Open Source Suno Alternative - Professional UI for ACE-Step 1.5 AI Music Generation. Free, local, unlimited. Stop paying …☆312Updated this week
- NotaGen: Advancing Musicality in Symbolic Music Generation with Large Language Model Training Paradigms☆1,155Updated 9 months ago
- The repository provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio), links for downloading the trai…☆3,256Updated last month
- TTS model capable of streaming conversational audio in realtime.☆1,051Updated 2 months ago
- A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics…☆862Updated last week
- A ComfyUI custom node suite for Qwen3-TTS, supporting 1.7B and 0.6B models, Custom Voice, Voice Design, Voice Cloning and Fine-Tuning.☆153Updated last week
- LTX-Video Support for ComfyUI☆3,095Updated last week
- VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning☆5,715Updated 2 weeks ago
- A TTS that fits in your CPU (and pocket)☆2,995Updated this week
- Modified version of Chatterbox that accepts text files as input and no character restrictions. I use it to make audiobooks, especially fo…☆516Updated 5 months ago
- Visual Novel Character Creation Suite is a comprehensive tool for creating character sprites for visual novels. It allows you to create u…☆722Updated this week
- ☆1,592Updated 2 months ago
- ComfyUI node for highly expressive speech and realistic zero-shot voice cloning☆380Updated last month