NeuralFalconYT / kokoro_v1
☆27Updated last month
Alternatives and similar repositories for kokoro_v1
Users that are interested in kokoro_v1 are comparing it to the libraries listed below
Sorting:
- ☆36Updated 3 months ago
- ☆66Updated last month
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆93Updated last month
- Examples of using the llasa-tts models locally☆172Updated 3 weeks ago
- A local implementation of the Kokoro Text-to-Speech model, featuring dynamic module loading, automatic dependency management, and a web i…☆176Updated this week
- just unzip and use it with gradio☆50Updated 3 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆78Updated 7 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆46Updated 4 months ago
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆61Updated last month
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆35Updated 6 months ago
- Collection of the best Applio plugins.☆29Updated 8 months ago
- Since the owner of the repo took it down and it used an MIT license, I guess it's okay to upload it here for people to use.☆39Updated 2 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆59Updated 6 months ago
- A custom node wrapper for Kokoro TTS for ComfyUI☆25Updated last month
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆73Updated last month
- A functioning Sesame CSM project with a desktop GUI - Real-time factor: 0.6x with 4070 Ti Super - Requires only 8GB VRAM☆31Updated 2 weeks ago
- Win & Liunux Gradio WebUI for CSM-1B model by sesame☆43Updated 2 months ago
- ☆96Updated last year
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion☆10Updated 7 months ago
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.☆41Updated last year
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆97Updated this week
- This node provides lip-sync capabilities in ComfyUI using ByteDance's LatentSync model. It allows you to synchronize video lips with audi…☆16Updated 4 months ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆30Updated this week
- API server for Instant voice cloning by MyShell.☆92Updated 7 months ago
- ⚡ AI Avatar Factory is an interface for creating and managing AI avatars. ⚡☆56Updated this week
- A Gradio UI for XTTSv2 and RVC.☆68Updated 7 months ago
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆204Updated 10 months ago
- Free ComfyUI Workflows☆22Updated 2 months ago
- Gradio UI for YuE☆47Updated last month
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆23Updated last month