SocAIty / Retrieval-based-Voice-Conversion-FastAPILinks
Adds a web API to RVC to infer via json requests
β29Updated last year
Alternatives and similar repositories for Retrieval-based-Voice-Conversion-FastAPI
Users that are interested in Retrieval-based-Voice-Conversion-FastAPI are comparing it to the libraries listed below
Sorting:
- A random walk voice style cloning application for Kokoro text to speechβ174Updated 5 months ago
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ160Updated last year
- β71Updated 8 months ago
- Quantized text-audio foundation model from Boson AIβ41Updated 3 months ago
- β50Updated last year
- β100Updated last year
- fast state-of-the-art speech models and a runtime that runs anywhere π₯β57Updated 5 months ago
- β46Updated 10 months ago
- Since the owner of the repo took it down and it used an MIT license, I guess it's okay to upload it here for people to use.β52Updated 8 months ago
- API server for Instant voice cloning by MyShell.β105Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"β81Updated last year
- β72Updated 4 months ago
- π Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model barkβ69Updated 5 months ago
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.β42Updated last year
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)β71Updated 2 years ago
- β83Updated last year
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)β52Updated last year
- XTTSv2 Extension for oobabooga text-generation-webuiβ155Updated 2 years ago
- SoTA open-source TTSβ123Updated last month
- TTS pipeline that uses RVC to enhance audio quality and cloningβ146Updated last year
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech Gβ¦β25Updated 8 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ98Updated last year
- Examples of using the llasa-tts models locallyβ181Updated 7 months ago
- Sesame Converse - Real Time Conversations - Powered by Gemma 3β64Updated 8 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking aroundβ¦β54Updated last year
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.β70Updated last year
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.β50Updated 10 months ago
- A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!β53Updated last week
- Canopyai Orpheus & LMStudio: 100% Uncensored Private Offline chatβ25Updated 7 months ago
- Writing Extension for Text Generation WebUIβ64Updated 3 months ago