voicepaw/so-vits-svc-fork

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/voicepaw/so-vits-svc-fork)

voicepaw / so-vits-svc-fork

so-vits-svc fork with realtime support, improved interface and more features.

☆9,329

Alternatives and similar repositories for so-vits-svc-fork

Users that are interested in so-vits-svc-fork are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

svc-develop-team / so-vits-svc
View on GitHub
SoftVC VITS Singing Voice Conversion
☆28,152Nov 11, 2023Updated 2 years ago
w-okada / voice-changer
View on GitHub
リアルタイムボイスチェンジャー Realtime Voice Changer
☆20,720Mar 21, 2026Updated 4 months ago
PlayVoice / whisper-vits-svc
View on GitHub
Core Engine of Singing Voice Conversion & Singing Voice Clone
☆2,864Apr 23, 2024Updated 2 years ago
RVC-Project / Retrieval-based-Voice-Conversion-WebUI
View on GitHub
Easily train a good VC model with voice data <= 10 mins!
☆36,902Jul 23, 2026Updated last week
Anjok07 / ultimatevocalremovergui
View on GitHub
GUI for a Vocal Remover that uses Deep Neural Networks.
☆25,633Mar 13, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
yxlllc / DDSP-SVC
View on GitHub
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
☆2,643Updated this week
PriesiaMioShirakana / DragonianVoice
View on GitHub
多个SVC/TTS的C++推理库
☆1,128May 18, 2025Updated last year
suno-ai / bark
View on GitHub
🔊 Text-Prompted Generative Audio Model
☆39,218Aug 19, 2024Updated last year
jaywalnut310 / vits
View on GitHub
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
☆7,888Dec 6, 2023Updated 2 years ago
flutydeer / audio-slicer
View on GitHub
A simple GUI application that slices audio with silence detection
☆1,459Apr 5, 2026Updated 3 months ago
Plachtaa / VITS-fast-fine-tuning
View on GitHub
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
☆5,017Jan 21, 2025Updated last year
prophesier / diff-svc
View on GitHub
Singing Voice Conversion via diffusion model
☆2,716Jun 6, 2026Updated last month
coqui-ai / TTS
View on GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆45,851Aug 16, 2024Updated last year
RVC-Boss / GPT-SoVITS
View on GitHub
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
☆60,321Jul 22, 2026Updated last week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
facebookresearch / audiocraft
View on GitHub
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…
☆23,532Mar 3, 2026Updated 4 months ago
OpenTalker / SadTalker
View on GitHub
[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
☆13,994Jun 26, 2024Updated 2 years ago
AUTOMATIC1111 / stable-diffusion-webui
View on GitHub
Stable Diffusion web UI
☆164,354Mar 2, 2026Updated 5 months ago
fishaudio / Bert-VITS2
View on GitHub
vits2 backbone with multilingual-bert
☆8,786Updated this week
neonbjb / tortoise-tts
View on GitHub
A multi-voice TTS system trained with an emphasis on quality
☆14,866Nov 19, 2024Updated last year
Plachtaa / VALL-E-X
View on GitHub
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
☆7,936Feb 11, 2024Updated 2 years ago
Mikubill / sd-webui-controlnet
View on GitHub
WebUI extension for ControlNet
☆17,852Aug 12, 2024Updated last year
MoonInTheRiver / DiffSinger
View on GitHub
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
☆4,835Jul 24, 2026Updated last week
PlayVoice / vits_chinese
View on GitHub
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!
☆1,229Feb 5, 2024Updated 2 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
Rudrabha / Wav2Lip
View on GitHub
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Mult…
☆13,141Jun 22, 2025Updated last year
PlayVoice / lora-svc
View on GitHub
singing voice change based on whisper, and lora for singing voice clone
☆646Nov 3, 2023Updated 2 years ago
oobabooga / textgen
View on GitHub
Open-source desktop app for local LLMs. Text, vision, tool-calling, OpenAI/Anthropic-compatible API. 100% private.
☆47,514Jun 2, 2026Updated 2 months ago
AIGC-Audio / AudioGPT
View on GitHub
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
☆10,169Jul 6, 2024Updated 2 years ago
myshell-ai / OpenVoice
View on GitHub
Instant voice cloning by MIT and MyShell. Audio foundation model.
☆37,070Apr 19, 2025Updated last year
fishaudio / fish-diffusion
View on GitHub
An easy to understand TTS / SVS / SVC framework
☆750Jun 1, 2026Updated 2 months ago
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,927Jul 26, 2026Updated last week
XingangPan / DragGAN
View on GitHub
Official Code for DragGAN (SIGGRAPH 2023)
☆35,785May 18, 2024Updated 2 years ago
CorentinJ / Real-Time-Voice-Cloning
View on GitHub
Clone a voice in 5 seconds to generate arbitrary speech in real-time
☆60,070Mar 9, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
OpenTalker / video-retalking
View on GitHub
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
☆7,269Aug 5, 2024Updated last year
henrymaas / AudioSlicer
View on GitHub
Audio Slicer that uses silence detection to split .wav audio files into multiple .wav samples.
☆302May 8, 2024Updated 2 years ago
iperov / DeepFaceLive
View on GitHub
Real-time face swap for PC streaming or video calls
☆31,011Nov 8, 2024Updated last year
lllyasviel / ControlNet
View on GitHub
Let us control diffusion models!
☆34,029Feb 25, 2024Updated 2 years ago
facefusion / facefusion
View on GitHub
Industry leading face manipulation platform
☆29,488Updated this week
IAHispano / Applio
View on GitHub
A simple, high-quality voice conversion tool focused on ease of use and performance.
☆3,543Updated this week
babysor / MockingBird
View on GitHub
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
☆36,915Mar 3, 2026Updated 4 months ago