coqui-ai / xtts-streaming-serverLinks
ā343Updated last year
Alternatives and similar repositories for xtts-streaming-server
Users that are interested in xtts-streaming-server are comparing it to the libraries listed below
Sorting:
- ā175Updated last year
- š š¤ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningā159Updated last year
- Running the F5-TTS by ONNX Runtimeā178Updated 3 weeks ago
- ā251Updated 2 months ago
- The code for the bark-voicecloning model. Training and inference.ā705Updated 2 years ago
- ā99Updated last year
- Official Implementation of StyleTTSā446Updated 7 months ago
- Open source inference code for Rev's modelā428Updated 4 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"ā65Updated 9 months ago
- Faster Tortoise inference then Tortoise Fast Forkā128Updated last year
- ā262Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translationā121Updated last year
- VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Designā601Updated 2 years ago
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesisā611Updated 5 months ago
- š Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. š§š„š Advanced audio processing.ā251Updated last year
- Implementation of Meta-Voicebox : The first generative AI model for speech to generalize across tasks with state-of-the-art performance.ā585Updated 2 years ago
- TorToiSe fine-tuning with DLASā224Updated last year
- FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversionā681Updated 7 months ago
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes sā¦ā52Updated last year
- Efficient approach to speaker diarization using voice characteristics extractionā99Updated 2 months ago
- The API server version of the SadTalker project. Runs in Docker, 10 times faster than the original!ā140Updated 2 years ago
- G2Pā316Updated last month
- Open models for Coqui STTā141Updated 2 years ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.ā68Updated 2 months ago
- A simple FastAPI Server to run XTTSv2ā537Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesā97Updated last year
- Live-Transcription (STT) with Whisper PoCā194Updated last year
- šø - A general purpose model trainer, as flexible as it getsā223Updated last year
- Local SRT/LLM/TTS Voicechatā716Updated 11 months ago
- FastAPI service on top of WhisperXā128Updated last week