billwuhao / ComfyUI_SparkTTSView external linksLinks
Using Spark-TTS in Comfyui. Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens
☆51May 23, 2025Updated 8 months ago
Alternatives and similar repositories for ComfyUI_SparkTTS
Users that are interested in ComfyUI_SparkTTS are comparing it to the libraries listed below
Sorting:
- A Text To Speech node using Kokoro TTS in ComfyUI. Supports 8 languages and 150 voices☆32Jun 2, 2025Updated 8 months ago
- Super fast multilingual speech recognition model based on Whisper Large-v3 Turbo. A node for ComfyUI.☆14May 23, 2025Updated 8 months ago
- ComfyUI Translation Nodes: XiaoMi GemmaX, QuickMT etc.☆27May 30, 2025Updated 8 months ago
- A ComfyUI node containing multiple audio processing tools.☆84Jul 7, 2025Updated 7 months ago
- ComfyUI custom_node for ByteDance's InfiniteYou☆11Apr 16, 2025Updated 9 months ago
- ComfyUI-PosterCraft is now available in ComfyUI, PosterCraft is a unified framework for high-quality aesthetic poster generation that exc…☆18Jun 26, 2025Updated 7 months ago
- ComfyUI-SparkTTS is a custom ComfyUI node implementation of SparkTTS, an advanced text-to-speech system that harnesses the power of large…☆124Apr 15, 2025Updated 9 months ago
- A Text To Speech node using Step-Audio-TTS in ComfyUI. Can speak, rap, sing, or clone voice.☆163May 23, 2025Updated 8 months ago
- ☆66Nov 14, 2024Updated last year
- ComfyUI Custom Nodes for "TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching". This generates high-quality 44…☆104Mar 28, 2025Updated 10 months ago
- Lightweight and Efficient, 🎧Ultra High-Quality Voice Cloning, Chinese and English.☆208Jun 11, 2025Updated 8 months ago
- 一个用于展示的节点☆37Mar 6, 2025Updated 11 months ago
- Unofficial implementation of LatentSync in ComfyUI☆17Jan 6, 2025Updated last year
- ComfyUI node that generates animated dotted waveform visualizations from audio input with multiple animation styles including teardrop-sh…☆29Nov 12, 2025Updated 3 months ago
- The simple implementation of the recraft sticker function☆128Sep 5, 2025Updated 5 months ago
- CosyVoice2 for ComfyUI☆167May 20, 2025Updated 8 months ago
- ComfyUI Implementation of Zonos Text to Speech Model☆23Feb 19, 2025Updated 11 months ago
- ComfyUI-Bagel is now available in ComfyUI, BAGEL is an open‑source multimodal foundation model with 7B active parameters (14B total) trai…☆29May 28, 2025Updated 8 months ago
- 带时间戳、标点符号,自动语音识别。给视频自动添加字幕。☆28Updated this week
- a custom node for separation vocals from music based on Music-Source-Separation-Training☆24Oct 24, 2024Updated last year
- Prompt Generator for Video, Audio, Image, and Text. A node for ComfyUI. Including Deepseek, Alibaba Cloud Qwen, Google Gemini, and locall…☆53Jul 11, 2025Updated 7 months ago
- ☆36Aug 10, 2025Updated 6 months ago
- KV-Edit: Training-Free Image Editing for Precise Background Preservation,you can use it in comfyUI☆61Sep 30, 2025Updated 4 months ago
- Use ‘DICE-Talk’ in ComfyUI,which is a method about 'Correlation-Aware Emotional Talking Portrait Generation'.☆25May 7, 2025Updated 9 months ago
- The successful integration of Qwen3-VL-Instruct series into the ComfyUI platform has enabled a smooth operation, supporting (but not limi…☆73Jan 5, 2026Updated last month
- ntegrate Topaz Photo AI's powerful image enhancement capabilities directly into your ComfyUI workflows.☆17May 24, 2025Updated 8 months ago
- A custom ComfyUI node for MiniCPM vision-language models, supporting v4, v4.5, and v4 GGUF formats, enabling high-quality image captionin…☆144Aug 28, 2025Updated 5 months ago
- The OminiControl plugin for ComfyUI☆141Dec 20, 2024Updated last year
- ☆94Apr 5, 2025Updated 10 months ago
- This node provides lip-sync capabilities in ComfyUI using ByteDance's LatentSync model. It allows you to synchronize video lips with audi…☆934Sep 4, 2025Updated 5 months ago
- A voice conversion extension node for ComfyUI based on FreeVC, enabling high-quality voice conversion capabilities within the ComfyUI fra…☆66Apr 3, 2025Updated 10 months ago
- Amphion-MaskGCT:0-sample voice synthesis and OpenAI-whisper-large-v3:Speech-to-text ComfyUI node packaging☆27Mar 5, 2025Updated 11 months ago
- A node for ComfyUI that performs GPEN face restoration on the input image(s). Significantly faster than other implementations of GPEN.☆67Apr 15, 2025Updated 9 months ago
- ☆11Jan 13, 2025Updated last year
- Creates prompts for Video Models by sequence analysis and prompting using Qwen2.5-VL models from Alibaba.☆53Apr 2, 2025Updated 10 months ago
- 🔥🔥🔥 Support TeaCache acceleration for 2x faster inference with minimal quality loss☆50May 6, 2025Updated 9 months ago
- You can apply makeup to the characters in comfyui☆102Jul 3, 2025Updated 7 months ago
- This is a ComfyUI plug-in for lllyasviel/FramePack, easy to use☆192May 5, 2025Updated 9 months ago
- ☆186Apr 17, 2025Updated 9 months ago