A simple FastAPI Server to run XTTSv2
☆593Jul 21, 2024Updated last year
Alternatives and similar repositories for xtts-api-server
Users that are interested in xtts-api-server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Webui for using XTTS and for finetuning it☆891Jan 17, 2025Updated last year
- AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of adv…☆2,394Jan 9, 2026Updated 5 months ago
- Slightly improved official version for finetune xtts☆392Apr 3, 2025Updated last year
- A Gradio UI for XTTSv2 and RVC.☆159May 28, 2024Updated 2 years ago
- [OBSOLETE] Extensions API for SillyTavern.☆685Dec 10, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆368Jun 26, 2024Updated 2 years ago
- Quick hack job to allow use with Sillytavern. This works for me, some further updates are expected to expose more settings to sillytavern☆11May 30, 2024Updated 2 years ago
- Converts text to speech in realtime☆3,971May 31, 2026Updated last month
- ☆73Aug 2, 2025Updated 11 months ago
- A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech☆403Dec 6, 2024Updated last year
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆6,295Aug 10, 2024Updated last year
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆2,276Jun 10, 2026Updated 3 weeks ago
- Launcher scripts for SillyTavern and ST-Extras.☆538Jun 2, 2026Updated last month
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆724Jun 17, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Using RVC via console or python scripts☆152Oct 18, 2024Updated last year
- ☆30Apr 8, 2024Updated 2 years ago
- An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.☆858Feb 2, 2025Updated last year
- Text to Speech using Coqui TTS + RVC☆113Nov 30, 2025Updated 7 months ago
- A single Gradio + React WebUI with extensions for ACE-Step, OmniVoice, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro,…☆3,186May 14, 2026Updated last month
- 111 VRM Animation Pack: For use with Silly Tavern☆80Apr 11, 2026Updated 2 months ago
- Diffusion_TTS extension for booga☆71Sep 6, 2025Updated 9 months ago
- Run GGUF models easily with a KoboldAI UI. One File. Zero Install.☆10,887Updated this week
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆713Jul 5, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- XTTSv2 Extension for oobabooga text-generation-webui☆157Nov 21, 2023Updated 2 years ago
- Oobabooga extension for Bark TTS☆119Nov 23, 2023Updated 2 years ago
- An Open Source text-to-speech system built by inverting Whisper.☆4,620Dec 14, 2025Updated 6 months ago
- MirrorMetrics: How to evaluate Stable Diffusion LoRAs. A visual diagnostic tool to detect overfitting, check dataset quality, and fix tra…☆58Feb 21, 2026Updated 4 months ago
- Loader extension for tabbyAPI in SillyTavern☆27Jun 30, 2025Updated last year
- Amica is an open source interface for interactive communication with 3D characters with voice synthesis and speech recognition.☆1,557Jul 23, 2025Updated 11 months ago
- Large-scale LLM inference engine☆1,777May 8, 2026Updated last month
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆45,642Aug 16, 2024Updated last year
- LLM Frontend for Power Users.☆29,994May 20, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/multiplatform CPU, AMD, NVIDIA GPU PyTorch support, handling, and auto-s…☆5,094Jun 18, 2026Updated 2 weeks ago
- A fast, local neural text to speech system☆11,184Aug 26, 2025Updated 10 months ago
- Foundational model for human-like, expressive TTS☆4,200Jul 30, 2024Updated last year
- Fine Tune the Style-TTS2 Voice Model☆267Jun 17, 2025Updated last year
- In this repository I will be running various experiments on finetune different parts for xtts☆15Jun 22, 2024Updated 2 years ago
- A simple, high-quality voice conversion tool focused on ease of use and performance.☆3,434Updated this week
- an auto-sleeping and -waking framework around llama.cpp☆13Feb 8, 2025Updated last year