Patrick-Ric / kokoro-tts-guiLinks
A GUI for text-to-speech processing using Kokoro ONNX
☆18Updated 8 months ago
Alternatives and similar repositories for kokoro-tts-gui
Users that are interested in kokoro-tts-gui are comparing it to the libraries listed below
Sorting:
- SoTA open-source TTS for Audiobook and Podcast Generation☆183Updated 7 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆37Updated 8 months ago
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆105Updated 2 months ago
- A random walk voice style cloning application for Kokoro text to speech☆202Updated 7 months ago
- OminiControl for the GPU Poor☆39Updated 11 months ago
- An extension to use Kokoro TTS in text generation webui☆21Updated 8 months ago
- Quantized text-audio foundation model from Boson AI☆43Updated 5 months ago
- An ComfyUI custom node integration for multi-language High-quality Text-to-Speech and Voice Conversion nodes using ResembleAI's Chatterbo…☆78Updated 4 months ago
- Additional non-node based UI for ComfyUI focused on inference. Stable UI states; presets; and advanced queue. Based on Gradio☆92Updated this week
- Examples of using the llasa-tts models locally☆182Updated 9 months ago
- Stable Diffusion GUI written in C++☆85Updated 3 months ago
- ☆58Updated last year
- A collection of compiled wheels for deepspeed built for python 3.10 and 3.11 with support for cuda 11.8 and 12.1 for Windows☆86Updated last year
- A functioning Sesame CSM project with a desktop GUI - Real-time factor: 0.6x with 4070 Ti Super - Requires only 8GB VRAM☆75Updated 8 months ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆107Updated 2 weeks ago
- ☆72Updated 5 months ago
- ☆100Updated last year
- Creative Image Enhancer/Upscaler. Powered By Refiners. 8GB VRAM | 10GB Install☆47Updated last month
- 🔊 Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model bark☆69Updated 6 months ago
- ☆135Updated 10 months ago
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆20Updated 2 years ago
- A ComfyUI custom node integration for multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, Cozy Voice 3, Step …☆572Updated this week
- ☆73Updated 10 months ago
- ☆75Updated last month
- 2D-to-3D image generator and viewer: https://tiefling.app☆117Updated 5 months ago
- (NVIDIA) FramePack is a next-frame (next-frame-section) prediction neural network structure that generates videos progressively.☆20Updated last month
- ComfyUI wrapper for Kokoro-onnx☆36Updated last year
- SoTA open-source TTS☆147Updated last month
- A local implementation of the Kokoro Text-to-Speech model, featuring dynamic module loading, automatic dependency management, and a web i…☆251Updated last month
- A Gradio UI for XTTSv2 and RVC.☆161Updated last year