idiap / coqui-ai-TTS
πΈπ¬ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
β590Updated this week
Related projects β
Alternatives and complementary repositories for coqui-ai-TTS
- AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of advβ¦β1,120Updated this week
- β646Updated 2 weeks ago
- A simple FastAPI Server to run XTTSv2β411Updated 3 months ago
- Webui for using XTTS and for finetuning itβ653Updated last month
- A nearly-live implementation of OpenAI's Whisper.β2,060Updated 2 weeks ago
- β762Updated this week
- Converts text to speech in realtimeβ2,023Updated this week
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsβ4,962Updated 3 months ago
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Cβ¦β518Updated 3 months ago
- β176Updated last month
- Slightly improved official version for finetune xttsβ236Updated 3 weeks ago
- A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speechβ270Updated 9 months ago
- Fast TorToiSe inference (5x or your money back!)β789Updated 4 months ago
- β296Updated 4 months ago
- Command Your World with Voiceβ443Updated this week
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ138Updated 4 months ago
- TTS Generation Web UI (Bark, MusicGen + AudioGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, MAGNet, StyleTTS2, MMS, Stable Audio, Mars5,β¦β1,826Updated this week
- π Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. π§ π₯π Advanced audio processing.β209Updated 5 months ago
- Suno AI's Bark model in C/C++ for fast text-to-speech generationβ730Updated this week
- β1,094Updated 5 months ago
- Controllable and fast Text-to-Speech for over 7000 languages!β1,464Updated 2 weeks ago
- Whisper realtime streaming for long speech-to-text transcription and translationβ2,092Updated this week
- [ICASSP 2024] π΅ Matcha-TTS: A fast TTS architecture with conditional flow matchingβ742Updated last week
- A webui for different audio related Neural Networksβ1,079Updated 3 months ago
- first base model for full-duplex conversational audioβ1,560Updated last week
- β308Updated this week
- A Gradio UI for XTTSv2 and RVC.β144Updated 5 months ago
- β87Updated 6 months ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engineβ312Updated 2 months ago
- Local SRT/LLM/TTS Voicechatβ544Updated last month