huangxu1991/GPT-SoVITS-VC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/huangxu1991/GPT-SoVITS-VC)

huangxu1991 / GPT-SoVITS-VC

VC Without Retrain!

☆130

Alternatives and similar repositories for GPT-SoVITS-VC

Users that are interested in GPT-SoVITS-VC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

uthree / fastersvc
View on GitHub
☆26Mar 20, 2024Updated 2 years ago
innnky / ar-vits
View on GitHub
text to speech using autoregressive transformer and VITS
☆248Apr 3, 2024Updated 2 years ago
idiap / knn-tts
View on GitHub
Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model
☆36Apr 29, 2025Updated last year
adelacvg / NS2VC
View on GitHub
Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech
☆236Feb 29, 2024Updated 2 years ago
v3ucn / Fix-Loudness
View on GitHub
音频响度统一，音量归一化处理
☆13May 3, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
uthree / tinyvc
View on GitHub
a lightweight voice conversion
☆87Feb 25, 2026Updated 4 months ago
thuhcsi / NeuCoSVC
View on GitHub
☆299May 22, 2024Updated 2 years ago
MaxMax2016 / Grad-TTS-Chinese
View on GitHub
Huawei Grad-TTS for Chinese
☆50Sep 26, 2023Updated 2 years ago
cnaigithub / Auto_Tuning_Zeroshot_TTS_and_VC
View on GitHub
Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…
☆80May 29, 2023Updated 3 years ago
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
adelacvg / diff-vits
View on GitHub
☆39Oct 1, 2023Updated 2 years ago
tarepan / SpeechMOS
View on GitHub
Easy-to-Use Speech MOS predictors
☆360Oct 24, 2023Updated 2 years ago
FireRedTeam / FireRedTTS
View on GitHub
An Open-Sourced LLM-empowered Foundation TTS System
☆908Sep 28, 2025Updated 9 months ago
yxlllc / ReFlow-VAE-SVC
View on GitHub
☆158Feb 6, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
hrnoh24 / stream-vc
View on GitHub
An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)
☆129Jun 11, 2026Updated last month
KevinWang676 / GPT-SoVITS-emo
View on GitHub
☆50May 1, 2024Updated 2 years ago
zhenye234 / FlashSpeech
View on GitHub
ACM MM 2024 FlashSpeech: Efficient Zero-Shot Speech Synthesis
☆155Sep 20, 2024Updated last year
caizexin / GenVC
View on GitHub
Self-supervised Generative LM-based Voice Conversion
☆58Apr 24, 2025Updated last year
foxyear-kyumin / lip_mask
View on GitHub
通过此代码可以免训练模型并通过轻量级服务器定制数字人形象
☆105Mar 27, 2024Updated 2 years ago
EZ-VC / EZ-VC
View on GitHub
[EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion
☆43Sep 9, 2025Updated 10 months ago
ASLP-lab / MeanVC
View on GitHub
A Lightweight and Streaming Zero-Shot Voice Conversion via Mean Flows
☆297Jan 8, 2026Updated 6 months ago
qiuqiao / DDSP-HiFiGAN
View on GitHub
基于PC-DDSP和nsf-HiFiGAN的声码器
☆19Jul 17, 2023Updated 3 years ago
quickvc / QuickVC-VoiceConversion
View on GitHub
QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion
☆261Jul 13, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
AliceNavigator / auto-VITS-DataLabeling
View on GitHub
Simple data labeling script with funasr inside. 使用阿里fanasr进行VITS训练数据标注
☆80Oct 10, 2023Updated 2 years ago
xincanfeng / vitsGPT
View on GitHub
☆60Jun 28, 2024Updated 2 years ago
liuhuang31 / g2pw_once
View on GitHub
G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…
☆14Dec 30, 2023Updated 2 years ago
KdaiP / StableTTS
View on GitHub
Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3
☆438Sep 13, 2024Updated last year
k2-fsa / ZipVoice
View on GitHub
Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching
☆1,016Dec 2, 2025Updated 7 months ago
quqixun / gpupixel_pywrapper
View on GitHub
A simple python wrapper for gpupixel using SourceRawDataInput and TargetRawDataOutput.
☆11Aug 14, 2024Updated last year
ex3ndr / supervoice-voicebox
View on GitHub
VoiceBox neural network implementation
☆110Aug 2, 2024Updated last year
hayeong0 / DDDM-VC
View on GitHub
Official Pytorch Implementation for "DDDM-VC: Decoupled Denoising Diffusion Models with Disentangled Representation and Prior Mixup for V…
☆244Jul 31, 2024Updated last year
ShawnPi233 / HQ-SVC
View on GitHub
Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice Conversion in Low-Resource Scenarios"(AAAI 2026)
☆108Jun 17, 2026Updated last month
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
tuanh123789 / Spark-TTS-finetune
View on GitHub
finetune llm part for spark-tts model
☆125Mar 25, 2025Updated last year
wujinzhong / Wav2Lip_TensorRT
View on GitHub
☆29Oct 1, 2023Updated 2 years ago
parrot-tts / Parrot-TTS
View on GitHub
Official Code for ParrotTTS
☆58Oct 13, 2024Updated last year
hcy71o / SNAC
View on GitHub
Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…
☆57Aug 7, 2023Updated 2 years ago
p0p4k / vits2_pytorch
View on GitHub
unofficial vits2-TTS implementation in pytorch
☆548Mar 28, 2024Updated 2 years ago
adelacvg / detail_tts
View on GitHub
All generative model in one for better TTS model
☆74Sep 8, 2024Updated last year
innnky / MagVITS
View on GitHub
VITS with phoneme-level prosody modeling based on MaskGIT
☆85Aug 31, 2024Updated last year