coqui-ai / xtts-streaming-server
ā318Updated 8 months ago
Alternatives and similar repositories for xtts-streaming-server:
Users that are interested in xtts-streaming-server are comparing it to the libraries listed below
- ā173Updated last year
- š š¤ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningā154Updated 8 months ago
- Open source inference code for Rev's modelā383Updated 2 weeks ago
- ā95Updated 10 months ago
- A simple FastAPI Server to run XTTSv2ā482Updated 8 months ago
- Faster Tortoise inference then Tortoise Fast Forkā128Updated 11 months ago
- Official Implementation of StyleTTSā429Updated 2 months ago
- Running the F5-TTS by ONNX Runtimeā123Updated this week
- The code for the bark-voicecloning model. Training and inference.ā693Updated last year
- Interface for OuteTTS models.ā955Updated last month
- ā207Updated 5 months ago
- Local SRT/LLM/TTS Voicechatā644Updated 5 months ago
- VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Designā544Updated last year
- [ICASSP 2024] šµ Matcha-TTS: A fast TTS architecture with conditional flow matchingā927Updated last week
- FastAPI service on top of WhisperXā72Updated this week
- Have a natural voice conversation with an LLMā245Updated 3 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detectionā632Updated 3 months ago
- FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversionā656Updated 2 months ago
- Whisper realtime streaming for long speech-to-text transcription and translationā113Updated last year
- ā254Updated last year
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Cā¦ā593Updated 7 months ago
- On-device streaming text-to-speech engine powered by deep learningā73Updated this week
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesisā467Updated last week
- The API server version of the SadTalker project. Runs in Docker, 10 times faster than the original!ā130Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.ā112Updated last year
- TorToiSe fine-tuning with DLASā218Updated 7 months ago
- A Fast TTS Engineā472Updated last month
- A lightweight end-to-end text-to-speech modelā110Updated 3 weeks ago
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)ā71Updated 9 months ago
- A ggml (C++) re-implementation of tortoise-ttsā178Updated 7 months ago