coqui-ai / xtts-streaming-server
ā323Updated 9 months ago
Alternatives and similar repositories for xtts-streaming-server:
Users that are interested in xtts-streaming-server are comparing it to the libraries listed below
- ā173Updated last year
- ā216Updated 3 weeks ago
- š š¤ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningā159Updated 8 months ago
- ā95Updated 11 months ago
- Have a natural voice conversation with an LLMā246Updated 4 months ago
- The code for the bark-voicecloning model. Training and inference.ā694Updated last year
- Official Implementation of StyleTTSā429Updated 3 months ago
- VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Designā551Updated last year
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesisā503Updated this week
- Open source inference code for Rev's modelā395Updated last month
- A simple FastAPI Server to run XTTSv2ā495Updated 8 months ago
- Faster Tortoise inference then Tortoise Fast Forkā128Updated 11 months ago
- The API server version of the SadTalker project. Runs in Docker, 10 times faster than the original!ā133Updated last year
- Running the F5-TTS by ONNX Runtimeā142Updated last week
- ā129Updated 4 months ago
- Local SRT/LLM/TTS Voicechatā658Updated 6 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detectionā667Updated 3 months ago
- ā254Updated last year
- TorToiSe fine-tuning with DLASā218Updated 8 months ago
- [ICASSP 2024] šµ Matcha-TTS: A fast TTS architecture with conditional flow matchingā964Updated last week
- Whisper realtime streaming for long speech-to-text transcription and translationā113Updated last year
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes sā¦ā52Updated 11 months ago
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3ā400Updated 7 months ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)ā76Updated 10 months ago
- ā354Updated 7 months ago
- unofficial vits2-TTS implementation in pytorchā516Updated last year
- Open models for Coqui STTā136Updated last year
- FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversionā660Updated 2 months ago
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Cā¦ā606Updated 8 months ago
- G2Pā202Updated last week