ZaVang/GPT-SoVits

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZaVang/GPT-SoVits)

ZaVang / GPT-SoVits

重构GPT-SOVITS的项目，重写了部分代码，优化了webui的使用以及增加了api调用

☆29

Alternatives and similar repositories for GPT-SoVits

Users that are interested in GPT-SoVits are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Apauto-to-all / GPT-soVITS-Inference-batchTool
View on GitHub
这是一个批量推理工具，对同一段文字进行多次推理，并且支持随机参数，直到筛选出最满意的结果。
☆11Aug 19, 2024Updated last year
Harry-Yu-Shuhang / Step-Audio-tts
View on GitHub
☆11Feb 20, 2025Updated last year
MaxMax2016 / max-vc
View on GitHub
singing voice conversion without f0
☆23May 10, 2023Updated 3 years ago
ywh-my / Easy-Finetune-Bert-VITS2
View on GitHub
Bert-VITS2项目bug多且教程不友好。本proj尽可能修复了Bert-vits2项目的bug，并且可一键启动训练。仅需50条目标说话人语音，获得稳定、快速的TTS模型。
☆69Aug 19, 2025Updated 11 months ago
LAION-AI / emotional-speech-annotations
View on GitHub
This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models
☆35Oct 13, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Zolewit / TNNdemo
View on GitHub
很好用的tnn classify demo
☆11Mar 24, 2021Updated 5 years ago
abinggo / genefacepp
View on GitHub
记录学习geneface++所遇到的各种问题
☆12Aug 5, 2024Updated last year
litagin02 / laughter-collector
View on GitHub
大量の音声データから笑い声部分を集めるやつ
☆14May 23, 2024Updated 2 years ago
adelacvg / detail_tts
View on GitHub
All generative model in one for better TTS model
☆74Sep 8, 2024Updated last year
xkx-hub / ISCSLP2024_CoVoC_baseline
View on GitHub
☆13Jun 8, 2024Updated 2 years ago
huahuahuage / Bert-VITS2-Speech
View on GitHub
Bert-VITS2 onnx推理版本
☆44Apr 24, 2024Updated 2 years ago
zhai-lw / L3AC
View on GitHub
A lightweight audio codec based on a single quantizer
☆35Sep 4, 2025Updated 10 months ago
xingchensong / CosyVoice-ttsfrd
View on GitHub
☆25Jun 19, 2025Updated last year
BaronVladziu / ESOLA-Implementation
View on GitHub
My implementation of Epoch-Synchronous Overlap-Add method for time stretching and pitch shifting.
☆10Jan 25, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
2DIPW / GPT-SoVITS-RefAudio-Tester
View on GitHub
GPT-SoVITS 参考音频推理效果批量试听
☆51Mar 8, 2024Updated 2 years ago
megaease / easevoice-trainer-portal
View on GitHub
EaseVoice Trainer is a simple and user-friendly voice cloning and speech model trainer.
☆15Apr 27, 2025Updated last year
jdh-algo / JoyTTS
View on GitHub
☆41Jul 15, 2025Updated last year
yrom / finetune-index-tts
View on GitHub
IndexTTS Fine-tuning notebooks
☆138Jun 17, 2025Updated last year
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
OlliV / DiaPro
View on GitHub
Dialog/Vocal Processor VST
☆12May 26, 2025Updated last year
AndrewBarker12345 / 3DAudio
View on GitHub
An audio effects plugin that simulates moving surround sound audio over headphones.
☆12Nov 11, 2020Updated 5 years ago
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
aisegmentcn / human-matting-sdk
View on GitHub
人像分割SDK（支持图片和视频），支持Windows, Android, iOS。human segmentation matting
☆25Aug 6, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
HaydonCardew / StereoToAmbi
View on GitHub
☆15Jan 4, 2022Updated 4 years ago
facebookresearch / Implicit-HRTF
View on GitHub
This repository contains the dataset used to train the neural network model descried in the paper "Implicit HRTF Modeling Using Tempora…
☆11Aug 4, 2023Updated 2 years ago
3DTune-In / 3dti_AudioToolkit_VST_Plugins
View on GitHub
VST plugin containing the 3D Tune-In Toolkit
☆11Mar 31, 2022Updated 4 years ago
jingzhunxue / flow_mirror
View on GitHub
flow mirror models from JZX AI Labs
☆43Sep 30, 2024Updated last year
apple-yinhan / Noise-robust-SED
View on GitHub
☆14Jan 2, 2025Updated last year
gdean725706 / AudioManipulator
View on GitHub
A real-time audio processing application for standalone and mobile devices
☆12Sep 10, 2018Updated 7 years ago
yl4579 / DMOSpeech2
View on GitHub
☆302Jul 22, 2025Updated last year
ZhouHuang23 / SBA-Net
View on GitHub
Dataset, code and results repository for SBA-Net.
☆14Sep 23, 2022Updated 3 years ago
RonLevie / LTFT-Phase-Vocoder
View on GitHub
LTFT-Phase-Vocoder is an audio effect that slows down an audio signal without dilating its frequency content or pitch.
☆16Dec 19, 2020Updated 5 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
lukemcraig / PathSynth
View on GitHub
A simple synthesizer where the oscillator is determined by a user-defined path.
☆18Nov 24, 2019Updated 6 years ago
PhonemeHallucinator / Phoneme_Hallucinator
View on GitHub
☆48Aug 16, 2023Updated 2 years ago
mockvox / mockvox
View on GitHub
This project aims to build a community-driven voice synthesis & cloning platform.
☆30Nov 16, 2025Updated 8 months ago
2DIPW / dub_genius
View on GitHub
基于GPT-SoVITS的视频剪辑快捷配音工具
☆176Mar 15, 2024Updated 2 years ago
Splapier / JohnAI
View on GitHub
Bringing together LLM, TTS, STT and other necessary tools in one project for easier custom AI Vtuber creation and usage
☆22Dec 10, 2025Updated 7 months ago
jingzhunxue / FlowMirror_HydraVox
View on GitHub
FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…
☆49Feb 17, 2026Updated 5 months ago
thuhcsi / NeuCoSVC
View on GitHub
☆299May 22, 2024Updated 2 years ago