SocAIty / Retrieval-based-Voice-Conversion-FastAPILinks
Adds a web API to RVC to infer via json requests
ā27Updated last year
Alternatives and similar repositories for Retrieval-based-Voice-Conversion-FastAPI
Users that are interested in Retrieval-based-Voice-Conversion-FastAPI are comparing it to the libraries listed below
Sorting:
- š š¤ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningā157Updated last year
- ā100Updated last year
- A random walk voice style cloning application for Kokoro text to speechā130Updated 3 months ago
- Streaming and Fine-tuning for Chatterbox TTSā182Updated 3 months ago
- ā83Updated last year
- š Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model barkā67Updated 2 months ago
- Since the owner of the repo took it down and it used an MIT license, I guess it's okay to upload it here for people to use.ā51Updated 6 months ago
- Examples of using the llasa-tts models locallyā180Updated 5 months ago
- API server for Instant voice cloning by MyShell.ā103Updated 11 months ago
- ā67Updated 6 months ago
- Quantized text-audio foundation model from Boson AIā36Updated last month
- SoTA open-source TTSā92Updated this week
- Simulates talk with an AI that can express emotionsā78Updated 3 months ago
- ā51Updated 10 months ago
- Sesame Converse - Real Time Conversations - Powered by Gemma 3ā64Updated 6 months ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)ā72Updated 2 years ago
- ā71Updated last month
- TTS pipeline that uses RVC to enhance audio quality and cloningā144Updated last year
- fast state-of-the-art speech models and a runtime that runs anywhere š„ā56Updated 3 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"ā81Updated 11 months ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)ā29Updated 3 months ago
- XTTSv2 Extension for oobabooga text-generation-webuiā155Updated last year
- A Gradio UI for XTTSv2 and RVC.ā159Updated last year
- ā91Updated 4 months ago
- Cog wrapper for Coqui / xtts-v2ā78Updated 9 months ago
- Fine Tune the Style-TTS2 Voice Modelā252Updated 3 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech Gā¦ā25Updated 5 months ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on ā¦ā101Updated 3 weeks ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)ā52Updated 11 months ago
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.ā49Updated 8 months ago