idiap/coqui-ai-TTS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/idiap/coqui-ai-TTS)

idiap / coqui-ai-TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

☆2,295

Alternatives and similar repositories for coqui-ai-TTS

Users that are interested in coqui-ai-TTS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

coqui-ai / TTS
View on GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆45,783Aug 16, 2024Updated last year
anhnh2002 / XTTSv2-Finetuning-for-New-Languages
View on GitHub
☆205Dec 9, 2024Updated last year
idiap / coqui-ai-Trainer
View on GitHub
🐸 - A general purpose model trainer, as flexible as it gets
☆16Apr 10, 2026Updated 3 months ago
daswer123 / xtts-api-server
View on GitHub
A simple FastAPI Server to run XTTSv2
☆595Jul 21, 2024Updated 2 years ago
erew123 / alltalk_tts
View on GitHub
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of adv…
☆2,416Jan 9, 2026Updated 6 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
SWivid / F5-TTS
View on GitHub
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
☆14,986Jul 5, 2026Updated 2 weeks ago
KoljaB / RealtimeTTS
View on GitHub
Converts text to speech in realtime
☆3,995May 31, 2026Updated last month
yl4579 / StyleTTS2
View on GitHub
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
☆6,314Aug 10, 2024Updated last year
rhasspy / piper
View on GitHub
A fast, local neural text to speech system
☆11,249Aug 26, 2025Updated 10 months ago
huggingface / parler-tts
View on GitHub
Inference and training library for high-quality TTS models.
☆5,582Dec 10, 2024Updated last year
canopyai / Orpheus-TTS
View on GitHub
Towards Human-Sounding Speech
☆6,251Dec 5, 2025Updated 7 months ago
tuanh123789 / Train_Hifigan_XTTS
View on GitHub
This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.
☆87Nov 12, 2024Updated last year
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,346Jun 9, 2026Updated last month
hexgrad / kokoro
View on GitHub
https://hf.co/hexgrad/Kokoro-82M
☆8,069Aug 6, 2025Updated 11 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
daswer123 / xtts-finetune-webui
View on GitHub
Slightly improved official version for finetune xtts
☆393Apr 3, 2025Updated last year
stlohrey / chatterbox-finetuning
View on GitHub
SoTA open-source TTS
☆136Jun 7, 2025Updated last year
astramind-ai / Auralis
View on GitHub
A Fast TTS Engine
☆626Jan 23, 2025Updated last year
daswer123 / xtts-webui
View on GitHub
Webui for using XTTS and for finetuning it
☆890Jan 17, 2025Updated last year
myshell-ai / OpenVoice
View on GitHub
Instant voice cloning by MIT and MyShell. Audio foundation model.
☆36,984Apr 19, 2025Updated last year
myshell-ai / MeloTTS
View on GitHub
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
☆7,546Dec 24, 2024Updated last year
neonbjb / tortoise-tts
View on GitHub
A multi-voice TTS system trained with an emphasis on quality
☆14,861Nov 19, 2024Updated last year
davidbrowne17 / chatterbox-streaming
View on GitHub
Streaming and Fine-tuning for Chatterbox TTS
☆291Jun 15, 2025Updated last year
resemble-ai / chatterbox
View on GitHub
SoTA open-source TTS
☆25,614Updated this week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
edwko / OuteTTS
View on GitHub
Interface for OuteTTS models.
☆1,435Mar 23, 2026Updated 3 months ago
SYSTRAN / faster-whisper
View on GitHub
Faster Whisper transcription with CTranslate2
☆24,424Nov 19, 2025Updated 8 months ago
coqui-ai / xtts-streaming-server
View on GitHub
☆368Jun 26, 2024Updated 2 years ago
metavoiceio / metavoice-src
View on GitHub
Foundational model for human-like, expressive TTS
☆4,203Jul 30, 2024Updated last year
suno-ai / bark
View on GitHub
🔊 Text-Prompted Generative Audio Model
☆39,204Aug 19, 2024Updated last year
rsxdalv / TTS-WebUI
View on GitHub
A single Gradio + React WebUI with extensions for ACE-Step, OmniVoice, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro,…
☆3,206Jul 6, 2026Updated 2 weeks ago
m-bain / whisperX
View on GitHub
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆23,170Jul 13, 2026Updated last week
WhisperSpeech / WhisperSpeech
View on GitHub
An Open Source text-to-speech system built by inverting Whisper.
☆4,625Dec 14, 2025Updated 7 months ago
duerig / StyleTTS2
View on GitHub
StyleTTS 2 Optimized Training Fork
☆32Feb 2, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
FunAudioLLM / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆22,323May 25, 2026Updated last month
matatonic / openedai-speech
View on GitHub
An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.
☆860Feb 2, 2025Updated last year
shivammehta25 / Matcha-TTS
View on GitHub
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
☆1,333Jul 13, 2026Updated last week
snakers4 / silero-vad
View on GitHub
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
☆9,636Updated this week
Zyphra / Zonos
View on GitHub
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expres…
☆7,229Mar 5, 2025Updated last year
KoljaB / RealtimeSTT
View on GitHub
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcri…
☆10,001Jun 12, 2026Updated last month
Camb-ai / MARS5-TTS
View on GitHub
MARS5 speech model (TTS) from CAMB.AI
☆2,816Aug 1, 2024Updated last year