coqui-ai / xtts-streaming-server
ā326Updated 10 months ago
Alternatives and similar repositories for xtts-streaming-server:
Users that are interested in xtts-streaming-server are comparing it to the libraries listed below
- ā174Updated last year
- ā223Updated last month
- š š¤ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningā159Updated 9 months ago
- ā96Updated last year
- Official Implementation of StyleTTSā431Updated 3 months ago
- ā255Updated last year
- VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Designā560Updated last year
- Running the F5-TTS by ONNX Runtimeā148Updated last week
- Faster Tortoise inference then Tortoise Fast Forkā128Updated last year
- A simple FastAPI Server to run XTTSv2ā504Updated 9 months ago
- The code for the bark-voicecloning model. Training and inference.ā696Updated last year
- G2Pā227Updated this week
- FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversionā661Updated 3 months ago
- Open source inference code for Rev's modelā402Updated last week
- [ICASSP 2024] šµ Matcha-TTS: A fast TTS architecture with conditional flow matchingā986Updated last month
- Local SRT/LLM/TTS Voicechatā667Updated 6 months ago
- Whisper realtime streaming for long speech-to-text transcription and translationā114Updated last year
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with Cā¦ā628Updated 8 months ago
- TorToiSe fine-tuning with DLASā220Updated 9 months ago
- Python bindings for whisper.cppā246Updated 2 weeks ago
- unofficial vits2-TTS implementation in pytorchā518Updated last year
- The reproduced code for Google's SoundStormā265Updated last year
- Have a natural voice conversation with an LLMā248Updated 4 months ago
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3ā405Updated 7 months ago
- Interface for OuteTTS models.ā1,205Updated this week
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes sā¦ā52Updated last year
- The API server version of the SadTalker project. Runs in Docker, 10 times faster than the original!ā134Updated last year
- Efficient approach to speaker diarization using voice characteristics extractionā92Updated last year
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detectionā686Updated 4 months ago
- Open models for Coqui STTā138Updated last year