mirbehnam / Kokoro-TTS-windows
just unzip and use it with gradio
☆36Updated last month
Alternatives and similar repositories for Kokoro-TTS-windows:
Users that are interested in Kokoro-TTS-windows are comparing it to the libraries listed below
- A local implementation of the Kokoro Text-to-Speech model, featuring dynamic module loading, automatic dependency management, and a web i…☆146Updated this week
- ☆36Updated last month
- ☆59Updated last week
- A Gradio UI for XTTSv2 and RVC.☆156Updated 10 months ago
- ☆403Updated 3 weeks ago
- Webui for using XTTS and for finetuning it☆111Updated 6 months ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆94Updated this week
- YuE: Open Full-song Generation Foundation for the GPU Poor☆350Updated last month
- A Text To Speech node using Kokoro TTS in ComfyUI☆40Updated last week
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆58Updated last week
- Slightly improved official version for finetune xtts☆71Updated 6 months ago
- ☆103Updated 2 weeks ago
- A Gradio UI for XTTSv2 and RVC.☆68Updated 6 months ago
- ☆28Updated 3 months ago
- Examples of using the llasa-tts models locally☆158Updated last month
- An image viewer and AI-assisted editing/captioning/masking tool that helps with curating datasets for generative AI models, finetunes and…☆106Updated last week
- Run Local and API LLMs, Features Gemini2 image generation, DEEPSEEK R1, QwenVL2.5, QWQ32B, Ollama, LlamaCPP LMstudio, Koboldcpp, TextGen,…☆105Updated last week
- Prompt-based Evolutionary Nudity Iteration System☆113Updated 3 weeks ago
- AI powered speech denoising and enhancement. Adapted for windows and optimized☆82Updated 8 months ago
- joy-caption-alpha-two -cli mod and gui mod☆65Updated 5 months ago
- Slightly improved official version for finetune xtts☆326Updated 5 months ago
- YuE: Open Full-song Generation Foundation Model, something similar to Suno.ai but open☆49Updated 3 weeks ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆33Updated 4 months ago
- gguf node for comfyui☆32Updated this week
- Jupyter notebooks for Inpainting | Outpainting with Flux.1 Fill dev. Able to run on Google Colab Free Tier☆29Updated 3 months ago
- Win & Liunux Gradio WebUI for CSM-1B model by sesame☆34Updated last week
- A Colab for the FluxGym Lora Training repository.☆61Updated 3 months ago
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆201Updated 9 months ago
- This node provides lip-sync capabilities in ComfyUI using ByteDance's LatentSync model. It allows you to synchronize video lips with audi…☆13Updated 2 months ago
- ☆20Updated last week