coqui-ai / TTSView external linksLinks
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
β44,516Aug 16, 2024Updated last year
Alternatives and similar repositories for TTS
Users that are interested in TTS are comparing it to the libraries listed below
Sorting:
- π Text-Prompted Generative Audio Modelβ38,961Aug 19, 2024Updated last year
- Instant voice cloning by MIT and MyShell. Audio foundation model.β35,918Apr 19, 2025Updated 9 months ago
- A multi-voice TTS system trained with an emphasis on qualityβ14,809Nov 19, 2024Updated last year
- Clone a voice in 5 seconds to generate arbitrary speech in real-timeβ59,336Dec 15, 2025Updated last month
- Robust Speech Recognition via Large-Scale Weak Supervisionβ94,315Dec 15, 2025Updated last month
- SOTA Open Source TTSβ24,863Feb 2, 2026Updated last week
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β10,111Nov 9, 2023Updated 2 years ago
- Port of OpenAI's Whisper model in C/C++β46,518Updated this week
- Faster Whisper transcription with CTranslate2β20,833Nov 19, 2025Updated 2 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β22,993Mar 13, 2025Updated 11 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ6,162Aug 10, 2024Updated last year
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.β103,139Updated this week
- LLM inference in C/C++β94,823Updated this week
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)β20,051Updated this week
- A generative speech model for daily dialogue.β38,696Jan 18, 2026Updated 3 weeks ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)β54,918Updated this week
- The definitive Web UI for local AI, with powerful features and easy setup.β46,037Feb 3, 2026Updated last week
- Stable Diffusion web UIβ160,514Dec 18, 2025Updated last month
- Get up and running with Kimi-K2.5, GLM-4.7, DeepSeek, gpt-oss, Qwen, Gemma and other models.β162,082Updated this week
- π¦π The platform for reliable agents.β126,317Updated this week
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"β14,048Feb 2, 2026Updated last week
- EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engineβ8,426Aug 13, 2024Updated last year
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.β77,136May 27, 2025Updated 8 months ago
- Amphion (/Γ¦mΛfaΙͺΙn/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junioβ¦β9,687May 27, 2025Updated 8 months ago
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.β7,186Dec 24, 2024Updated last year
- Inference and training library for high-quality TTS models.β5,528Dec 10, 2024Updated last year
- Build and share delightful machine learning apps, all in Python. π Star to support our work!β41,593Feb 2, 2026Updated last week
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/β7,956Feb 11, 2024Updated 2 years ago
- A fast, local neural text to speech systemβ10,533Aug 26, 2025Updated 5 months ago
- LlamaIndex is the leading framework for building LLM-powered agents over your data.β46,977Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.β39,402Jun 2, 2025Updated 8 months ago
- Industry leading face manipulation platformβ26,669Updated this week
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamicalβ¦β37,452Aug 17, 2024Updated last year
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement, running on consumer-gβ¦β42,767Updated this week
- Foundational Models for State-of-the-Art Speech and Text Translationβ11,745Nov 14, 2024Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMsβ70,205Updated this week
- A natural language interface for computersβ62,041Dec 5, 2025Updated 2 months ago
- Zero-Shot Speech Editing and Text-to-Speech in the Wildβ8,465Mar 15, 2025Updated 10 months ago
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.β19,578Updated this week