SocAIty / Retrieval-based-Voice-Conversion-FastAPILinks
Adds a web API to RVC to infer via json requests
ā26Updated 10 months ago
Alternatives and similar repositories for Retrieval-based-Voice-Conversion-FastAPI
Users that are interested in Retrieval-based-Voice-Conversion-FastAPI are comparing it to the libraries listed below
Sorting:
- š Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model barkā64Updated 2 months ago
- ā67Updated 2 months ago
- ā68Updated 7 months ago
- ā83Updated 11 months ago
- ā96Updated last year
- Collection of the best Applio plugins.ā29Updated 8 months ago
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes sā¦ā52Updated last year
- Audio datasets, easier.ā84Updated last year
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocosā46Updated 11 months ago
- fast state-of-the-art speech models and a runtime that runs anywhere š„ā55Updated 3 weeks ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on ā¦ā96Updated 3 weeks ago
- Simulates talk with an AI that can express emotionsā69Updated 10 months ago
- A random walk voice style cloning application for Kokoro text to speechā85Updated last week
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)ā26Updated this week
- Examples of using the llasa-tts models locallyā172Updated last month
- šļø Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets āØā38Updated 2 weeks ago
- ā50Updated 6 months ago
- A functioning Sesame CSM project with a desktop GUI - Real-time factor: 0.6x with 4070 Ti Super - Requires only 8GB VRAMā36Updated 2 weeks ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"ā79Updated 7 months ago
- ā43Updated 4 months ago
- Think of it as giving your AI a searchable diary and knowledge base that grows with every conversation.ā16Updated 3 weeks ago
- Sesame Converse - Real Time Conversations - Powered by Gemma 3ā62Updated 2 months ago
- Win & Liunux Gradio WebUI for CSM-1B model by sesameā44Updated 2 months ago
- An API for VoiceCraft.ā25Updated 11 months ago
- A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0ā53Updated 11 months ago
- ACE-Step: A Step Towards Music Generation Foundation Modelā40Updated 2 weeks ago
- Gradio UI for YuEā56Updated 2 months ago
- ā16Updated last year
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.ā61Updated 7 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Modelsā36Updated 3 weeks ago