coqui-ai / TTSLinks
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
β43,627Updated last year
Alternatives and similar repositories for TTS
Users that are interested in TTS are comparing it to the libraries listed below
Sorting:
- π Text-Prompted Generative Audio Modelβ38,767Updated last year
- A multi-voice TTS system trained with an emphasis on qualityβ14,715Updated last year
- Instant voice cloning by MIT and MyShell. Audio foundation model.β35,521Updated 7 months ago
- Clone a voice in 5 seconds to generate arbitrary speech in real-timeβ58,930Updated 2 months ago
- Robust Speech Recognition via Large-Scale Weak Supervisionβ91,194Updated 2 months ago
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β10,065Updated 2 years ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/β7,972Updated last year
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ6,070Updated last year
- Faster Whisper transcription with CTranslate2β19,191Updated last week
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.β6,990Updated 11 months ago
- Foundational Models for State-of-the-Art Speech and Text Translationβ11,713Updated last year
- A fast, local neural text to speech systemβ10,284Updated 3 months ago
- Silero VAD: pre-trained enterprise-grade Voice Activity Detectorβ7,450Updated last week
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"β13,696Updated this week
- Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API keyβ9,410Updated 3 months ago
- Silero Models: pre-trained text-to-speech models made embarrassingly simpleβ5,616Updated last week
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)β18,908Updated last month
- This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multβ¦β12,603Updated 5 months ago
- Zero-Shot Speech Editing and Text-to-Speech in the Wildβ8,442Updated 8 months ago
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speechβ7,746Updated last year
- EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engineβ8,383Updated last year
- Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontenβ¦β12,389Updated last month
- Port of OpenAI's Whisper model in C/C++β44,820Updated last week
- An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.β39,287Updated 6 months ago
- An Open Source text-to-speech system built by inverting Whisper.β4,533Updated 5 months ago
- Industry leading face manipulation platformβ25,979Updated this week
- πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.β2,540Updated last year
- SOTA Open Source TTSβ24,191Updated 3 weeks ago
- Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speakerβ¦β8,761Updated this week
- Inference and training library for high-quality TTS models.β5,484Updated 11 months ago