SocAIty / Retrieval-based-Voice-Conversion-FastAPI
Adds a web API to RVC to infer via json requests
☆21Updated 8 months ago
Alternatives and similar repositories for Retrieval-based-Voice-Conversion-FastAPI:
Users that are interested in Retrieval-based-Voice-Conversion-FastAPI are comparing it to the libraries listed below
- ☆46Updated 4 months ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆66Updated last year
- ☆59Updated last week
- 🔊 Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model bark☆58Updated last week
- ☆41Updated 2 months ago
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.☆68Updated 9 months ago
- ☆83Updated 9 months ago
- ☆95Updated 11 months ago
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.☆41Updated last year
- Local11Labs allows generating high-quality text-to-speech and podcast content using the fast and tiny Kokoro-82M.☆46Updated 2 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆155Updated 8 months ago
- Since the owner of the repo took it down and it used an MIT license, I guess it's okay to upload it here for people to use.☆32Updated 2 weeks ago
- A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0☆50Updated 9 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆77Updated 5 months ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆94Updated this week
- SadTalker gradio_demo.py file with code section that allows you to set the eye blink and pose reference videos for the software to use wh…☆11Updated last year
- Orpheus Chat WebUI☆32Updated this week
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆59Updated 5 months ago
- Create text chunks which end at natural stopping points without using a tokenizer☆26Updated 2 weeks ago
- ☆25Updated 11 months ago
- ☆54Updated last year
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆33Updated 4 months ago
- 🎵 LyricWave - Your AI Music Composer 🎶 Compose Unique MP4 Songs Effortlessly! LyricWave uses AI to create personalized music by harmoni…☆29Updated 2 weeks ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆21Updated this week
- Windows-compatible Fast API implementation of VoiceCraft, the Zero-Shot Speech Editing and Text-to-Speech in the Wild☆21Updated 11 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 5 months ago
- Yet Another (LLM) Web UI, made with Gemini☆11Updated 3 months ago
- Advanced RVC Inference for quicker and effortless model downloads☆47Updated last week
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆52Updated 11 months ago
- Towards Robust Blind Face Restoration with Codebook Lookup Transformer☆28Updated last year