A Text To Speech node using Step-Audio-TTS in ComfyUI. Can speak, rap, sing, or clone voice.
☆163May 23, 2025Updated 10 months ago
Alternatives and similar repositories for ComfyUI_StepAudioTTS
Users that are interested in ComfyUI_StepAudioTTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Using Spark-TTS in Comfyui. Spark-TTS: An Efficient LLM-Based Text-to-Speech Model with Single-Stream Decoupled Speech Tokens☆51May 23, 2025Updated 10 months ago
- A ComfyUI node containing multiple audio processing tools.☆90Jul 7, 2025Updated 8 months ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆237May 28, 2025Updated 9 months ago
- Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation. A node for ComfyUI.☆149May 30, 2025Updated 9 months ago
- In order to make it easier to use the ComfyUI, I have made some optimizations and integrations to some commonly used nodes.☆10Mar 3, 2025Updated last year
- This node provides lip-sync capabilities in ComfyUI using ByteDance's LatentSync model. It allows you to synchronize video lips with audi…☆941Sep 4, 2025Updated 6 months ago
- A Text To Speech node using Kokoro TTS in ComfyUI. Supports 8 languages and 150 voices☆33Jun 2, 2025Updated 9 months ago
- YuE is a groundbreaking series of open-source foundation models designed for music generation, specifically for transforming lyrics into …☆185Feb 24, 2025Updated last year
- Lightweight and Efficient, 🎧Ultra High-Quality Voice Cloning, Chinese and English.☆209Jun 11, 2025Updated 9 months ago
- Official Implementation of Attention Distillation for ComfyUI☆110Mar 18, 2025Updated last year
- ☆143Dec 14, 2025Updated 3 months ago
- Sonic is a method about ' Shifting Focus to Global Audio Perception in Portrait Animation',you can use it in comfyUI☆1,130Sep 27, 2025Updated 5 months ago
- ☆186Apr 17, 2025Updated 11 months ago
- ComfyUI Hunyuan3D-1-wrapper is a custom node that allows you to run Tencent/Hunyuan3D-1 in ComfyUI as a wrapper.☆33Nov 13, 2024Updated last year
- ComfyUI Custom Nodes for "TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching". This generates high-quality 44…☆104Mar 28, 2025Updated 11 months ago
- ComfyUI_Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation☆73Mar 12, 2025Updated last year
- ☆248May 14, 2025Updated 10 months ago
- Memory-Guided Diffusion for Expressive Talking Video Generation☆173Mar 9, 2025Updated last year
- Comfyui中3D实时打光的简单实现☆213Jun 16, 2025Updated 9 months ago
- ZenID FaceSwap☆218Jul 3, 2025Updated 8 months ago
- ☆244Apr 3, 2025Updated 11 months ago
- ComfyUI nodes to run FLUX OminiControl using diffusers☆39Dec 8, 2024Updated last year
- The OminiControl plugin for ComfyUI☆143Dec 20, 2024Updated last year
- JoyCaption ComfyUI Nodes☆120Feb 25, 2026Updated 3 weeks ago
- Super fast multilingual speech recognition model based on Whisper Large-v3 Turbo. A node for ComfyUI.☆14May 23, 2025Updated 10 months ago
- Unofficial custom_node for AnyText v1.1: https://github.com/tyxsspa/AnyText and AnyText v2.0: https://github.com/tyxsspa/AnyText2 and Gly…☆99May 28, 2025Updated 9 months ago
- ComfyUI node for F5-Text To Speech☆258Feb 3, 2026Updated last month
- A set of comfyui multi class nodes☆15Sep 3, 2025Updated 6 months ago
- Transcribe audio and add subtitles to videos using Whisper in ComfyUI☆218Jan 2, 2026Updated 2 months ago
- The nodes detached from [ComfyUI Layer Style](https://github.com/chflame163/ComfyUI_LayerStyle) are mainly those with complex requirement…☆627Feb 22, 2026Updated last month
- A simple 3D model processing tool within ComfyUI☆23Oct 18, 2024Updated last year
- 一款ComfyUI扩展节点,能够为您的图像添加各种精美的艺术文字效果,支持丰富的文字样式和特效。☆30Mar 21, 2025Updated last year
- Generative Motion Latent Flow Matching for Audio-driven Talking Portrait☆264Jan 2, 2026Updated 2 months ago
- ☆463Jun 22, 2025Updated 9 months ago
- ☆90Jun 18, 2025Updated 9 months ago
- This node is based on MykolaL/StableDesign☆15Aug 7, 2025Updated 7 months ago
- MuseTalk audio driven face inpainting☆69May 21, 2024Updated last year
- Advanced Vision Model Loader for Comfy UI☆260Mar 6, 2025Updated last year
- A ComfyUI custom node designed for advanced image background removal and object, face, clothes, and fashion segmentation, utilizing multi…☆1,823Feb 3, 2026Updated last month