woct0rdho / ACE-StepLinks
Fork of ACE-Step for LoRA training with < 10 GB VRAM
☆18Updated last week
Alternatives and similar repositories for ACE-Step
Users that are interested in ACE-Step are comparing it to the libraries listed below
Sorting:
- ☆113Updated this week
- YuE with mp3 extend, exllama and GUI☆53Updated 3 months ago
- Awesome music generation model——MG²☆157Updated 2 months ago
- ☆78Updated 8 months ago
- Gradio UI for YuE☆58Updated 2 months ago
- Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.☆46Updated 9 months ago
- Flexible LoRA Implementation to use with stable-audio-tools☆72Updated 9 months ago
- Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).☆110Updated 4 months ago
- TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching☆60Updated last month
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆74Updated 8 months ago
- ☆60Updated this week
- GUI for the new musubi-tuner☆37Updated 4 months ago
- the comfyui custom node for UVR5 to separate vocals and background music☆95Updated last year
- FoleyCrafter is a video-to-audio generation framework which can produce realistic sound effects semantically relevant and synchronized wi…☆61Updated 3 weeks ago
- Music production for silent film clips.☆25Updated last month
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆101Updated 5 months ago
- ☆174Updated 5 months ago
- ☆75Updated last year
- ☆80Updated 3 months ago
- Gradio UI for training video models using finetrainers☆30Updated 2 months ago
- A comprehensive codebase for training and finetuning Image <> Latent models.☆35Updated 3 months ago
- Symbolic Music Generation, NotaGen node for ComfyUI.☆45Updated 2 weeks ago
- Fine-tune Stable Audio Open with DiT ControlNet.☆232Updated last month
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆165Updated last year
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers☆99Updated last month
- ☆75Updated last week
- ☆36Updated 4 months ago
- YuE: Open Full-song Generation Foundation Model, something similar to Suno.ai but open☆66Updated last month
- This is a pre-built wheel of Triton 3.3.0 for Windows with Nvidia only + Proton☆26Updated last month
- This is a simple ComfyUI custom TTS node based on Parler_tts.☆44Updated 5 months ago