billwuhao / ComfyUI_StepAudioTTSLinks
A Text To Speech node using Step-Audio-TTS in ComfyUI. Can speak, rap, sing, or clone voice.
☆163Updated 8 months ago
Alternatives and similar repositories for ComfyUI_StepAudioTTS
Users that are interested in ComfyUI_StepAudioTTS are comparing it to the libraries listed below
Sorting:
- YuE is a groundbreaking series of open-source foundation models designed for music generation, specifically for transforming lyrics into …☆184Updated 11 months ago
- a comfyui custom node for CosyVoice☆287Updated last year
- A ComfyUI node containing multiple audio processing tools.☆82Updated 7 months ago
- Lightweight and Efficient, 🎧Ultra High-Quality Voice Cloning, Chinese and English.☆208Updated 7 months ago
- DiffuEraser is a diffusion model for video Inpainting, you can use it in ComfyUI☆238Updated 2 months ago
- ☆66Updated last year
- CosyVoice2 for ComfyUI☆166Updated 8 months ago
- Some useful custom nodes that are not included in ComfyUI core yet☆97Updated 9 months ago
- Generative Motion Latent Flow Matching for Audio-driven Talking Portrait☆242Updated last month
- This is a ComfyUI plugin that makes it easier to call and run workflows from RunningHub in your local ComfyUI setup.☆209Updated 3 months ago
- CogVideoX-5B 4-bit quantization model☆110Updated last year
- Generate detailed image descriptions and analysis using Molmo models in ComfyUI.☆139Updated last year
- ComfyUI-SparkTTS is a custom ComfyUI node implementation of SparkTTS, an advanced text-to-speech system that harnesses the power of large…☆124Updated 9 months ago
- The implementation of MiniCPM-V-4_5 has been seamlessly integrated into the ComfyUI platform, enabling the support for text-based querie…☆253Updated 5 months ago
- 本仓库用FFmpeg在ComfyUI上实现各种视频处理(This repository uses FFmpeg to implement various video processing tasks on ComfyUI.)☆133Updated 5 months ago
- ☆210Updated last year
- A voice conversion extension node for ComfyUI based on FreeVC, enabling high-quality voice conversion capabilities within the ComfyUI fra…☆66Updated 10 months ago
- Using Spark-TTS in Comfyui. Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens☆51Updated 8 months ago
- Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation. A node for ComfyUI.☆149Updated 8 months ago
- This is a ComfyUI-Windows implementation of the image animation projects -> UniAnimate and Animate-X.☆185Updated 4 months ago
- Memory-Guided Diffusion for Expressive Talking Video Generation☆173Updated 11 months ago
- An wrapper for Turbodiffusion to support 100-200x fast video generations.☆210Updated last month
- This repository provides the official ComfyUI workflow for ICEdit.☆204Updated 6 months ago
- The successful integration of Qwen3-VL-Instruct series into the ComfyUI platform has enabled a smooth operation, supporting (but not limi…☆509Updated 3 months ago
- Prompt Generator for Video, Audio, Image, and Text. A node for ComfyUI. Including Deepseek, Alibaba Cloud Qwen, Google Gemini, and locall…☆53Updated 6 months ago
- You can call Using Sapiens to get seg,normal,pose,depth,mask☆200Updated 10 months ago
- Seed-VC voice or sing conversion.☆56Updated 7 months ago
- ☆223Updated 4 months ago
- ☆131Updated last year
- An improved wrapper for the FramePack project that allows the creation of videos of any length based on a reference images and LoRA's wit…☆112Updated 8 months ago