diodiogod / TTS-Audio-SuiteLinks
A ComfyUI custom node integration for multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, IndexTTS-2, Chatterbox (classic and multilingual 23-lang), F5-TTS, Higgs Audio 2 and Microsoft VibeVoice with unlimited text length, SRT timing, Character support, Audio Analyzer, Silent Speech Analyzer, audio edit and more
☆416Updated last week
Alternatives and similar repositories for TTS-Audio-Suite
Users that are interested in TTS-Audio-Suite are comparing it to the libraries listed below
Sorting:
- ComfyUI node for F5-Text To Speech☆241Updated 3 weeks ago
- TTS + Voice Cloning☆172Updated 3 months ago
- An ComfyUI custom node integration for multi-language High-quality Text-to-Speech and Voice Conversion nodes using ResembleAI's Chatterbo…☆77Updated 3 months ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆189Updated 6 months ago
- ☆471Updated 2 months ago
- ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio☆514Updated 2 months ago
- ☆138Updated 3 months ago
- A Text To Speech node using Kokoro TTS in ComfyUI☆64Updated 8 months ago
- Enable true multi gpu capability in Comfy UI using XDiT XFuser and FSDP managed by Ray☆214Updated this week
- Generative Motion Latent Flow Matching for Audio-driven Talking Portrait☆239Updated 3 months ago
- ComfyUI custom nodes and web utilities for real-time AI generation and interaction☆314Updated 3 weeks ago
- ComfyUI Chatterbox TTS & Voice Conversion Node☆67Updated 3 months ago
- ☆30Updated 2 weeks ago
- ☆130Updated 8 months ago
- YuE: Open Full-song Generation Foundation for the GPU Poor☆450Updated 9 months ago
- ComfyUI Wrapper for HiDream☆482Updated 7 months ago
- ComfyUI extension that enables multi-GPU processing locally, remotely and in the cloud☆417Updated last month
- A custom node wrapper for Kokoro TTS for ComfyUI☆40Updated 2 months ago
- ComfyUI Wrapper for HiDream - 4bit linux loading fix☆98Updated 6 months ago
- YuE is a groundbreaking series of open-source foundation models designed for music generation, specifically for transforming lyrics into …☆166Updated 9 months ago
- ComfyUI nodes for WanAnimate model input preprocessing☆327Updated last month
- Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation. A node for ComfyUI.☆140Updated 5 months ago
- ☆189Updated 6 months ago
- 3D x AI hybrid editor, built with three.js☆107Updated 2 months ago
- This node provides lip-sync capabilities in ComfyUI using ByteDance's LatentSync model. It allows you to synchronize video lips with audi…☆23Updated 10 months ago
- ComfyUI workflow customization by Jake.☆127Updated this week
- SoTA open-source TTS for Audiobook and Podcast Generation☆172Updated 5 months ago
- Run Local and API LLMs, Features Gemini2 image generation, DEEPSEEK R1, QwenVL2.5, QWQ32B, Ollama, LlamaCPP LMstudio, Koboldcpp, TextGen,…☆144Updated 7 months ago
- AI-api text generation☆138Updated 2 months ago
- A pipeline parallel training script for diffusion models.☆147Updated 5 months ago