coqui-ai / TTSLinks
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
β44,391Updated last year
Alternatives and similar repositories for TTS
Users that are interested in TTS are comparing it to the libraries listed below
Sorting:
- Build a basic version of PayTMβ17Updated 7 months ago
- game testβ16Updated 8 months ago
- i partice c and c++β17Updated last week
- π Text-Prompted Generative Audio Modelβ38,938Updated last year
- A multi-voice TTS system trained with an emphasis on qualityβ14,794Updated last year
- Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)β10,107Updated 2 years ago
- Amphion (/Γ¦mΛfaΙͺΙn/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junioβ¦β9,670Updated 8 months ago
- VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speechβ7,809Updated 2 years ago
- [SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wildβ7,209Updated last year
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/β7,960Updated last year
- Robust Speech Recognition via Large-Scale Weak Supervisionβ93,999Updated last month
- Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontenβ¦β12,503Updated this week
- This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Multβ¦β12,805Updated 7 months ago
- Port of OpenAI's Whisper model in C/C++β46,315Updated this week
- EmotiVoice π: a Multi-Voice and Prompt-Controlled TTS Engineβ8,417Updated last year
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)β19,802Updated 3 months ago
- JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.β4,674Updated last year
- πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.β2,555Updated last year
- A natural language interface for computersβ61,933Updated last month
- π Text-prompted Generative Audio Model - With the ability to clone voicesβ3,339Updated 5 months ago
- SOTA Open Source TTSβ24,723Updated 3 weeks ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ6,144Updated last year
- Build and share delightful machine learning apps, all in Python. π Star to support our work!β41,496Updated this week
- Foundational Models for State-of-the-Art Speech and Text Translationβ11,739Updated last year
- A fast, local neural text to speech systemβ10,497Updated 5 months ago
- Instant voice cloning by MIT and MyShell. Audio foundation model.β35,856Updated 9 months ago
- Faster Whisper transcription with CTranslate2β20,707Updated 2 months ago
- An Open Source text-to-speech system built by inverting Whisper.β4,551Updated last month
- High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.β7,160Updated last year
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement, running on consumer-gβ¦β42,470Updated this week