πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
β44,763Aug 16, 2024Updated last year
Alternatives and similar repositories for TTS
Users that are interested in TTS are comparing it to the libraries listed below
Sorting:
- π Text-Prompted Generative Audio Modelβ39,043Aug 19, 2024Updated last year
- Instant voice cloning by MIT and MyShell. Audio foundation model.β36,049Apr 19, 2025Updated 10 months ago
- A multi-voice TTS system trained with an emphasis on qualityβ14,820Nov 19, 2024Updated last year
- Clone a voice in 5 seconds to generate arbitrary speech in real-timeβ59,512Updated this week
- Robust Speech Recognition via Large-Scale Weak Supervisionβ95,527Dec 15, 2025Updated 2 months ago
- SOTA Open Source TTSβ25,154Mar 5, 2026Updated last week
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β10,122Nov 9, 2023Updated 2 years ago
- Port of OpenAI's Whisper model in C/C++β47,262Mar 5, 2026Updated last week
- Faster Whisper transcription with CTranslate2β21,289Nov 19, 2025Updated 3 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β23,048Mar 3, 2026Updated last week
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ6,196Aug 10, 2024Updated last year
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.β105,651Updated this week
- LLM inference in C/C++β97,252Updated this week
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)β20,556Feb 22, 2026Updated 2 weeks ago
- A generative speech model for daily dialogue.β38,905Jan 18, 2026Updated last month
- The best local UI for large language models, with easy setup and powerful features. 100% offline.β46,214Updated this week
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)β55,605Feb 9, 2026Updated last month
- Stable Diffusion web UIβ161,629Mar 2, 2026Updated last week
- Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.β164,248Mar 6, 2026Updated last week
- The agent engineering platformβ128,595Updated this week
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"β14,169Mar 4, 2026Updated last week
- EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engineβ8,454Aug 13, 2024Updated last year
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.β77,210May 27, 2025Updated 9 months ago
- Amphion (/Γ¦mΛfaΙͺΙn/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junioβ¦β9,709May 27, 2025Updated 9 months ago
- Inference and training library for high-quality TTS models.β5,547Dec 10, 2024Updated last year
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.β7,242Dec 24, 2024Updated last year
- Build and share delightful machine learning apps, all in Python. π Star to support our work!β42,001Updated this week
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/β7,955Feb 11, 2024Updated 2 years ago
- A fast, local neural text to speech systemβ10,633Aug 26, 2025Updated 6 months ago
- LlamaIndex is the leading document agent and OCR platformβ47,608Updated this week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.β39,418Jun 2, 2025Updated 9 months ago
- Industry leading face manipulation platformβ26,995Mar 5, 2026Updated last week
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamicalβ¦β37,438Aug 17, 2024Updated last year
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement, running on consumer-gβ¦β43,471Updated this week
- Foundational Models for State-of-the-Art Speech and Text Translationβ11,760Mar 3, 2026Updated last week
- A high-throughput and memory-efficient inference and serving engine for LLMsβ72,827Updated this week
- Zero-Shot Speech Editing and Text-to-Speech in the Wildβ8,466Mar 15, 2025Updated 11 months ago
- A natural language interface for computersβ62,652Feb 9, 2026Updated last month
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.β19,913Feb 11, 2026Updated last month