ComfyUI Implementation of Zonos Text to Speech Model
☆23Feb 19, 2025Updated last year
Alternatives and similar repositories for ComfyUI-ZonosTTS
Users that are interested in ComfyUI-ZonosTTS are comparing it to the libraries listed below
Sorting:
- Super fast multilingual speech recognition model based on Whisper Large-v3 Turbo. A node for ComfyUI.☆14May 23, 2025Updated 9 months ago
- ComfyUI-Bagel is now available in ComfyUI, BAGEL is an open‑source multimodal foundation model with 7B active parameters (14B total) trai…☆29May 28, 2025Updated 9 months ago
- This extension integrates ByteDance's UNO-FLUX model into ComfyUI, allowing you to use UNO's powerful text-to-image generation with refer…☆28Apr 17, 2025Updated 10 months ago
- ComfyUI custom_node for ByteDance's InfiniteYou☆11Apr 16, 2025Updated 10 months ago
- ComfyUI Dia text to speech☆14May 29, 2025Updated 9 months ago
- ComfyUI node to make text to speech audio with your own voices.☆73Apr 29, 2025Updated 10 months ago
- A Text To Speech node using Kokoro TTS in ComfyUI. Supports 8 languages and 150 voices☆33Jun 2, 2025Updated 9 months ago
- ComfyUI-MagnifyGlass: A powerful & customizable magnifying glass for ComfyUI. Zoom into canvas details with smooth controls, configurable…☆23Updated this week
- A powerful ComfyUI custom node that brings Google's Gemini TTS capabilities directly to your workflow. Generate high-quality speech with …☆21May 23, 2025Updated 9 months ago
- The `ComfyUI_pixtral_vision` node is a powerful ComfyUI node designed to integrate seamlessly with the Mistral Pixtral API. It facilitate…☆18Nov 20, 2024Updated last year
- 带时间戳、标点符号,自动语音识别。给视频自动添加字幕。☆28Feb 9, 2026Updated 3 weeks ago
- Extract LoRA from the original Fine-Tuned model. 从微调模型中提取lora。☆20May 5, 2025Updated 10 months ago
- Portrait Tools: Facial detection cropping, alignment, ID photo, etc☆19Jun 15, 2025Updated 8 months ago
- Using Spark-TTS in Comfyui. Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens☆51May 23, 2025Updated 9 months ago
- A ComfyUI custom node that integrates Mistral AI's Pixtral Large vision model, enabling powerful multimodal AI capabilities within ComfyU…☆21Jul 21, 2025Updated 7 months ago
- A comprehensive ComfyUI wrapper for HiggsAudio v2, enabling high-quality text-to-speech generation with advanced voice cloning capabiliti…☆27Jul 26, 2025Updated 7 months ago
- ComfyUI custom nodes to create a speech dataset☆21Jun 17, 2025Updated 8 months ago
- Image adjustments node for ComfyUI. Contrast, gamma, saturation, hue rotate, R, G and B channel offset, filter color, sharpness, unsharp …☆42Jul 10, 2025Updated 7 months ago
- This node provides lip-sync capabilities in ComfyUI using ByteDance's LatentSync model. It allows you to synchronize video lips with audi…☆23Jan 12, 2025Updated last year
- An End-to-End Pipeline for Enhanced French Text-to-Speech with SSML Prosody Control☆31Jan 13, 2026Updated last month
- A quick port of Resynthesizer (the Gimp plug-in for content aware fill) to ComfyUI.☆30Jul 25, 2025Updated 7 months ago
- A custom node for ComfyUI that integrates DeepSeek's R1 powerful chat and instruction API, enabling seamless AI interactions within your …☆19Jan 27, 2025Updated last year
- ComfyUI Translation Nodes: XiaoMi GemmaX, QuickMT etc.☆27May 30, 2025Updated 9 months ago
- Comfyui-SadTalker☆22Oct 16, 2025Updated 4 months ago
- A node in comfyui for one-click assisted prompt generation (for image and video generation, etc.).☆24Jul 7, 2025Updated 7 months ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆231May 28, 2025Updated 9 months ago
- Implementation of CSM from SesameAILabs☆33Jan 22, 2026Updated last month
- A voice conversion extension node for ComfyUI based on FreeVC, enabling high-quality voice conversion capabilities within the ComfyUI fra…☆67Apr 3, 2025Updated 11 months ago
- About DeepSeek Chat API☆34Feb 23, 2025Updated last year
- A ComfyUI node containing multiple audio processing tools.☆87Jul 7, 2025Updated 7 months ago
- a custom comfyui node for coqui-ai/TTS's xtts module! support 17 languages voice cloning and tts☆66Jun 24, 2024Updated last year
- Seed-VC voice or sing conversion.☆55Jun 11, 2025Updated 8 months ago
- ComfyUI nodes for transcription on audio or video input.☆29Apr 23, 2025Updated 10 months ago
- A ComfyUI custom node extension that integrates the Janus-Pro-7B vision-language model from DeepSeek AI, enabling powerful image understa…☆31Mar 20, 2025Updated 11 months ago
- An advanced custom node for ComfyUI that provides optimized access to Wan2.1, a state-of-the-art video foundation model suite. The WanVid…☆38Feb 27, 2025Updated last year
- Continual Resilient (CoRe) Optimizer for PyTorch☆11Jun 10, 2024Updated last year
- A count down clock to embed in reveal.js presentations.☆11Jan 6, 2023Updated 3 years ago
- ☆43Mar 27, 2025Updated 11 months ago
- A ComfyUI extension for OmniGen2☆48Jul 1, 2025Updated 8 months ago