πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
β44,554Aug 16, 2024Updated last year
Alternatives and similar repositories for TTS
Users that are interested in TTS are comparing it to the libraries listed below
Sorting:
- π Text-Prompted Generative Audio Modelβ38,970Aug 19, 2024Updated last year
- Instant voice cloning by MIT and MyShell. Audio foundation model.β35,938Apr 19, 2025Updated 10 months ago
- A multi-voice TTS system trained with an emphasis on qualityβ14,809Nov 19, 2024Updated last year
- Clone a voice in 5 seconds to generate arbitrary speech in real-timeβ59,349Dec 15, 2025Updated 2 months ago
- Robust Speech Recognition via Large-Scale Weak Supervisionβ94,628Dec 15, 2025Updated 2 months ago
- SOTA Open Source TTSβ24,906Feb 2, 2026Updated 2 weeks ago
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β10,112Nov 9, 2023Updated 2 years ago
- Port of OpenAI's Whisper model in C/C++β46,720Feb 9, 2026Updated last week
- Faster Whisper transcription with CTranslate2β20,951Nov 19, 2025Updated 3 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β22,999Mar 13, 2025Updated 11 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ6,164Aug 10, 2024Updated last year
- The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.β103,139Feb 13, 2026Updated last week
- LLM inference in C/C++β95,169Updated this week
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)β20,162Updated this week
- A generative speech model for daily dialogue.β38,714Jan 18, 2026Updated last month
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)β55,069Feb 9, 2026Updated last week
- The definitive Web UI for local AI, with powerful features and easy setup.β46,051Feb 3, 2026Updated 2 weeks ago
- Stable Diffusion web UIβ160,596Dec 18, 2025Updated 2 months ago
- Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.β162,576Updated this week
- π¦π The platform for reliable agents.β126,727Updated this week
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"β14,079Updated this week
- EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engineβ8,425Aug 13, 2024Updated last year
- GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.β77,126May 27, 2025Updated 8 months ago
- Amphion (/Γ¦mΛfaΙͺΙn/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junioβ¦β9,690May 27, 2025Updated 8 months ago
- Inference and training library for high-quality TTS models.β5,533Dec 10, 2024Updated last year
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.β7,189Dec 24, 2024Updated last year
- Build and share delightful machine learning apps, all in Python. π Star to support our work!β41,698Feb 13, 2026Updated last week
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/β7,958Feb 11, 2024Updated 2 years ago
- A fast, local neural text to speech systemβ10,559Aug 26, 2025Updated 5 months ago
- LlamaIndex is the leading framework for building LLM-powered agents over your data.β46,977Feb 13, 2026Updated last week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.β39,409Jun 2, 2025Updated 8 months ago
- Industry leading face manipulation platformβ26,787Updated this week
- OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamicalβ¦β37,452Aug 17, 2024Updated last year
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement, running on consumer-gβ¦β42,895Updated this week
- Foundational Models for State-of-the-Art Speech and Text Translationβ11,748Nov 14, 2024Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMsβ70,673Updated this week
- Zero-Shot Speech Editing and Text-to-Speech in the Wildβ8,461Mar 15, 2025Updated 11 months ago
- A natural language interface for computersβ62,135Feb 9, 2026Updated last week
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.β19,623Feb 11, 2026Updated last week