Slightly improved official version for finetune xtts
☆384Apr 3, 2025Updated 11 months ago
Alternatives and similar repositories for xtts-finetune-webui
Users that are interested in xtts-finetune-webui are comparing it to the libraries listed below
Sorting:
- Webui for using XTTS and for finetuning it☆875Jan 17, 2025Updated last year
- A Gradio UI for XTTSv2 and RVC.☆162May 28, 2024Updated last year
- Webui for using XTTS and for finetuning it☆113Sep 22, 2024Updated last year
- A simple FastAPI Server to run XTTSv2☆575Jul 21, 2024Updated last year
- A Gradio UI for XTTSv2 and RVC.☆65Sep 26, 2024Updated last year
- Fine Tune the Style-TTS2 Voice Model☆269Jun 17, 2025Updated 8 months ago
- Slightly improved official version for finetune xtts☆70Sep 22, 2024Updated last year
- AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of adv…☆2,265Jan 9, 2026Updated 2 months ago
- ☆785Jun 9, 2025Updated 9 months ago
- In this repository I will be running various experiments on finetune different parts for xtts☆15Jun 22, 2024Updated last year
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆87Nov 12, 2024Updated last year
- A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech☆389Dec 6, 2024Updated last year
- A UI for the Piper TTS☆109Aug 31, 2024Updated last year
- Diffusion_TTS extension for booga☆69Sep 6, 2025Updated 6 months ago
- A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice,…☆3,004Feb 19, 2026Updated 2 weeks ago
- ☆195Dec 9, 2024Updated last year
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆258Jun 10, 2024Updated last year
- A GUI for text-to-speech processing using Kokoro ONNX☆18May 21, 2025Updated 9 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆6,196Aug 10, 2024Updated last year
- Using RVC via console or python scripts☆141Oct 18, 2024Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆14,169Updated this week
- Inference and training library for high-quality TTS models.☆5,547Dec 10, 2024Updated last year
- ☆523Feb 21, 2026Updated 2 weeks ago
- AI Search engine☆13Sep 24, 2025Updated 5 months ago
- This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for …☆13Oct 4, 2024Updated last year
- Win & Liunux Gradio WebUI for CSM-1B model by sesame☆52Mar 17, 2025Updated 11 months ago
- ☆363Jun 26, 2024Updated last year
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆2,174Feb 12, 2026Updated 3 weeks ago
- A webui for different audio related Neural Networks☆1,236May 19, 2025Updated 9 months ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆25Jul 27, 2024Updated last year
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆105Nov 19, 2025Updated 3 months ago
- A simple, high-quality voice conversion tool focused on ease of use and performance.☆3,032Updated this week
- 🚀 RVC + UVR = A perfect set of tools for voice cloning, easily and free!☆228Jul 12, 2025Updated 7 months ago
- A ComfyUI custom node integration for multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, Echo-TTS, Qwen3-TTS…☆736Mar 2, 2026Updated last week
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Jul 15, 2024Updated last year
- Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expres…☆7,193Mar 5, 2025Updated last year
- Since the owner of the repo took it down and it used an MIT license, I guess it's okay to upload it here for people to use.☆53Mar 11, 2025Updated 11 months ago
- Foundational model for human-like, expressive TTS☆4,198Jul 30, 2024Updated last year
- A multi-voice TTS system trained with an emphasis on quality☆14,818Nov 19, 2024Updated last year