idiap / coqui-ai-TTS
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
β837Updated this week
Alternatives and similar repositories for coqui-ai-TTS:
Users that are interested in coqui-ai-TTS are comparing it to the libraries listed below
- AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advβ¦β1,320Updated last week
- Converts text to speech in realtimeβ2,287Updated this week
- Interface for OuteTTS models.β859Updated this week
- Controllable and fast Text-to-Speech for over 7000 languages!β1,519Updated 2 months ago
- β1,110Updated this week
- β1,106Updated 6 months ago
- Webui for using XTTS and for finetuning itβ710Updated 3 months ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generationβ762Updated 2 months ago
- An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.β622Updated 4 months ago
- [ICASSP 2024] π΅ Matcha-TTS: A fast TTS architecture with conditional flow matchingβ825Updated 2 weeks ago
- β692Updated 2 months ago
- A simple FastAPI Server to run XTTSv2β447Updated 5 months ago
- first base model for full-duplex conversational audioβ1,669Updated last week
- An Open Source text-to-speech system built by inverting Whisper.β4,080Updated last month
- The code for the bark-voicecloning model. Training and inference.β681Updated last year
- TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5,β¦β1,947Updated last month
- Slightly improved official version for finetune xttsβ289Updated 2 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ5,265Updated 5 months ago
- A simple, high-quality voice conversion tool focused on ease of use and performance.β1,984Updated this week
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detectionβ518Updated 3 weeks ago
- MARS5 speech model (TTS) from CAMB.AIβ2,589Updated 5 months ago
- Command Your World with Voiceβ506Updated last month
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Cβ¦β555Updated 5 months ago
- A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speechβ298Updated last month
- Whisper realtime streaming for long speech-to-text transcription and translationβ2,316Updated last week
- Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesisβ863Updated 5 months ago
- AI powered speech denoising and enhancementβ1,581Updated last month
- A Fast TTS Engineβ405Updated last week
- General Speech Restorationβ1,067Updated 7 months ago
- A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcriβ¦β3,507Updated 2 weeks ago