RVC-Project/Retrieval-based-Voice-Conversion-WebUI

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RVC-Project/Retrieval-based-Voice-Conversion-WebUI)

RVC-Project / Retrieval-based-Voice-Conversion-WebUI

Easily train a good VC model with voice data <= 10 mins!

☆36,438

Alternatives and similar repositories for Retrieval-based-Voice-Conversion-WebUI

Users that are interested in Retrieval-based-Voice-Conversion-WebUI are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

w-okada / voice-changer
View on GitHub
リアルタイムボイスチェンジャー Realtime Voice Changer
☆20,619Mar 21, 2026Updated 3 months ago
RVC-Boss / GPT-SoVITS
View on GitHub
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
☆59,806Updated this week
svc-develop-team / so-vits-svc
View on GitHub
SoftVC VITS Singing Voice Conversion
☆28,146Nov 11, 2023Updated 2 years ago
Anjok07 / ultimatevocalremovergui
View on GitHub
GUI for a Vocal Remover that uses Deep Neural Networks.
☆25,398Mar 13, 2025Updated last year
voicepaw / so-vits-svc-fork
View on GitHub
so-vits-svc fork with realtime support, improved interface and more features.
☆9,327Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
AUTOMATIC1111 / stable-diffusion-webui
View on GitHub
Stable Diffusion web UI
☆164,252Mar 2, 2026Updated 4 months ago
fishaudio / Bert-VITS2
View on GitHub
vits2 backbone with multilingual-bert
☆8,773Updated this week
suno-ai / bark
View on GitHub
🔊 Text-Prompted Generative Audio Model
☆39,197Aug 19, 2024Updated last year
yxlllc / DDSP-SVC
View on GitHub
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
☆2,625Feb 22, 2026Updated 4 months ago
coqui-ai / TTS
View on GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆45,756Aug 16, 2024Updated last year
Comfy-Org / ComfyUI
View on GitHub
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
☆120,828Updated this week
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,273Jun 9, 2026Updated last month
facefusion / facefusion
View on GitHub
Industry leading face manipulation platform
☆29,288Updated this week
PlayVoice / whisper-vits-svc
View on GitHub
Core Engine of Singing Voice Conversion & Singing Voice Clone
☆2,861Apr 23, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
jaywalnut310 / vits
View on GitHub
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
☆7,882Dec 6, 2023Updated 2 years ago
Plachtaa / VITS-fast-fine-tuning
View on GitHub
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
☆5,019Jan 21, 2025Updated last year
myshell-ai / OpenVoice
View on GitHub
Instant voice cloning by MIT and MyShell. Audio foundation model.
☆36,943Apr 19, 2025Updated last year
2noise / ChatTTS
View on GitHub
A generative speech model for daily dialogue.
☆39,619Apr 10, 2026Updated 3 months ago
iperov / DeepFaceLive
View on GitHub
Real-time face swap for PC streaming or video calls
☆30,993Nov 8, 2024Updated last year
OpenTalker / SadTalker
View on GitHub
[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
☆13,952Jun 26, 2024Updated 2 years ago
FunAudioLLM / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆22,188May 25, 2026Updated last month
facebookresearch / audiocraft
View on GitHub
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…
☆23,476Mar 3, 2026Updated 4 months ago
Mangio621 / Mangio-RVC-Fork
View on GitHub
*CREPE+HYBRID TRAINING* A very experimental fork of the Retrieval-based-Voice-Conversion-WebUI repo that incorporates a variety of other …
☆1,228Sep 27, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
babysor / MockingBird
View on GitHub
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
☆36,922Mar 3, 2026Updated 4 months ago
oobabooga / textgen
View on GitHub
Open-source desktop app for local LLMs. Text, vision, tool-calling, OpenAI/Anthropic-compatible API. 100% private.
☆47,450Jun 2, 2026Updated last month
Mikubill / sd-webui-controlnet
View on GitHub
WebUI extension for ControlNet
☆17,847Aug 12, 2024Updated last year
IAHispano / Applio
View on GitHub
A simple, high-quality voice conversion tool focused on ease of use and performance.
☆3,481Updated this week
openai / whisper
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆104,995Apr 15, 2026Updated 3 months ago
SWivid / F5-TTS
View on GitHub
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
☆14,952Jul 5, 2026Updated last week
guoyww / AnimateDiff
View on GitHub
Official implementation of AnimateDiff.
☆12,187Jul 31, 2024Updated last year
Plachtaa / VALL-E-X
View on GitHub
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
☆7,937Feb 11, 2024Updated 2 years ago
CorentinJ / Real-Time-Voice-Cloning
View on GitHub
Clone a voice in 5 seconds to generate arbitrary speech in real-time
☆60,028Mar 9, 2026Updated 4 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Rudrabha / Wav2Lip
View on GitHub
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Mult…
☆13,096Jun 22, 2025Updated last year
Plachtaa / seed-vc
View on GitHub
zero-shot voice conversion & singing voice conversion, with real-time support
☆3,870Apr 20, 2025Updated last year
neonbjb / tortoise-tts
View on GitHub
A multi-voice TTS system trained with an emphasis on quality
☆14,860Nov 19, 2024Updated last year
OpenTalker / video-retalking
View on GitHub
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
☆7,265Aug 5, 2024Updated last year
open-mmlab / Amphion
View on GitHub
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…
☆9,931Mar 25, 2026Updated 3 months ago
MoonInTheRiver / DiffSinger
View on GitHub
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
☆4,827Mar 19, 2025Updated last year
prophesier / diff-svc
View on GitHub
Singing Voice Conversion via diffusion model
☆2,713Jun 6, 2026Updated last month