X-T-E-R/GPT-SoVITS-Inference

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/X-T-E-R/GPT-SoVITS-Inference)

X-T-E-R / GPT-SoVITS-Inference

Inference Specialization

☆514

Alternatives and similar repositories for GPT-SoVITS-Inference

Users that are interested in GPT-SoVITS-Inference are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

X-T-E-R / Uni-TTS
View on GitHub
本项目意图在于让使用各类语音合成引擎的方式变得统一，支持多种语音合成引擎适配器，允许直接作为模组使用或启动后端服务
☆770Apr 15, 2024Updated 2 years ago
ben0oil1 / GPT-SoVITS-Server
View on GitHub
【脱离复杂的环境配置和整合包，极简配置推理服务】从GPT-SoVITS项目里面提取出来的，纯粹的推理服务方案。
☆325Apr 11, 2024Updated 2 years ago
Apauto-to-all / GPT-soVITS-Inference-batchTool
View on GitHub
这是一个批量推理工具，对同一段文字进行多次推理，并且支持随机参数，直到筛选出最满意的结果。
☆11Aug 19, 2024Updated last year
RVC-Boss / GPT-SoVITS
View on GitHub
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
☆59,977Jul 13, 2026Updated last week
KakaruHayate / ColorSplitter
View on GitHub
A cli tool for split vocal timbre.
☆293Jan 17, 2026Updated 6 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
2DIPW / gpt_sovits_infer_with_emotion
View on GitHub
基于中文文本情绪分析自动切换参考音频的 GPT-SoVITS 推理 Demo
☆108Mar 8, 2024Updated 2 years ago
Artrajz / vits-simple-api
View on GitHub
A simple VITS HTTP API, developed by extending Moegoe with additional features.
☆1,048May 18, 2026Updated 2 months ago
jianchang512 / gptsovits-api
View on GitHub
适用于 GPT-SoVITS 的api调用接口
☆345Mar 7, 2024Updated 2 years ago
High-Logic / Genie-TTS
View on GitHub
GPT-SoVITS ONNX Inference Engine & Model Converter
☆1,663Apr 18, 2026Updated 3 months ago
Alexw1111 / RefAudioEmoTagger
View on GitHub
一种基于Emotion2Vec的批量音频情感自动标注脚本
☆543Mar 7, 2025Updated last year
2DIPW / GPT-SoVITS-RefAudio-Tester
View on GitHub
GPT-SoVITS 参考音频推理效果批量试听
☆51Mar 8, 2024Updated 2 years ago
notiom / ER-nerf
View on GitHub
主要写er-nerf从零到一所有部署过程
☆44Aug 28, 2024Updated last year
lrxwisdom001 / GPT-SoVITS-Novels
View on GitHub
Make audio books in one click! Let Genshin characters read novels for you!
☆29Aug 2, 2024Updated last year
FunAudioLLM / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆22,292May 25, 2026Updated last month
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
fishaudio / Bert-VITS2
View on GitHub
vits2 backbone with multilingual-bert
☆8,776Jul 13, 2026Updated last week
yxlllc / vocal-remover
View on GitHub
Vocal Remover using Deep Neural Networks
☆21Dec 31, 2024Updated last year
X-D-Lab / GPT_SoVITS_Inference
View on GitHub
☆26Mar 13, 2024Updated 2 years ago
huahuahuage / Bert-VITS2-Speech
View on GitHub
Bert-VITS2 onnx推理版本
☆44Apr 24, 2024Updated 2 years ago
AliceNavigator / SpeakerClassifier
View on GitHub
A lightweight tool that efficiently isolates target speaker data from your datasets.
☆20Nov 23, 2024Updated last year
v3ucn / GPT-SoVITS-V2
View on GitHub
GPT-SoVITS-V2模型，合并了官方的一些PR，包含但不限于:参考音频自动填充，字幕同步，SillyTavern酒馆接入等功能
☆206Jan 15, 2025Updated last year
PriesiaMioShirakana / DragonianLib
View on GitHub
☆67Jul 26, 2025Updated 11 months ago
RVC-Project / Retrieval-based-Voice-Conversion-WebUI
View on GitHub
Easily train a good VC model with voice data <= 10 mins!
☆36,515Updated this week
natlamir / MeloTTS-Windows
View on GitHub
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
☆17Sep 18, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,332Jun 9, 2026Updated last month
AliceNavigator / GPT-SoVITS-Api-GUI
View on GitHub
GAG is a GUI for GPT-SoVITS inference. Just add it to the official integration package and run for a smoother experience.
☆245Jun 24, 2025Updated last year
2noise / ChatTTS
View on GitHub
A generative speech model for daily dialogue.
☆39,651Apr 10, 2026Updated 3 months ago
chenxiVFX / A-customizable-audiobook-generator-based-on-GPT-SoVITS-for-personalized-voice-tones.
View on GitHub
通过AI实现对话者的识别并进行文段分割，再接入语音合成，自动生成有声小说
☆29Apr 3, 2025Updated last year
huangxu1991 / GPT-SoVITS-VC
View on GitHub
VC Without Retrain!
☆130Apr 27, 2024Updated 2 years ago
innnky / MagVITS
View on GitHub
VITS with phoneme-level prosody modeling based on MaskGIT
☆85Aug 31, 2024Updated last year
FunAudioLLM / SenseVoice
View on GitHub
Open-source SenseVoiceSmall model for Mandarin, Cantonese, English, Japanese, and Korean ASR, language ID, emotion recognition, and audio…
☆8,902Updated this week
TheStingerX / Ilaria-RVC-Mainline
View on GitHub
☆20Jun 7, 2024Updated 2 years ago
TMElyralab / MuseTalk
View on GitHub
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
☆6,199Sep 26, 2025Updated 9 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
TMElyralab / MuseV
View on GitHub
MuseV: Infinite-length and High Fidelity Virtual Human Video Generation with Visual Conditioned Parallel Denoising
☆2,843Jun 28, 2024Updated 2 years ago
AI-Hobbyist / Genshin_Datasets
View on GitHub
Genshin Datasets For SVC/SVS/TTS
☆735Jan 11, 2026Updated 6 months ago
v3ucn / live2d-TTS-LLM-GPT-SoVITS-Vtuber
View on GitHub
低成本的简单基于live2d TTS文字转语音和大模型聊天的直播解决方案
☆281Jul 4, 2024Updated 2 years ago
Ikaros-521 / AI-Vtuber
View on GitHub
AI Vtuber是一个由【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】驱动的虚拟主播【Live2D/UE/xuniren】，可以在【Bilibili/抖音/…
☆4,404Jul 29, 2025Updated 11 months ago
modelscope / FunASR
View on GitHub
Open-source speech recognition toolkit for training, inference, streaming ASR, VAD, punctuation, speaker diarization pipelines, and OpenA…
☆19,364Updated this week
litagin02 / anime_speaker_embedding
View on GitHub
Speaker embedding for anime speech domain based on ECAPA_TDNN
☆21Jun 22, 2025Updated last year
lipku / LiveTalking
View on GitHub
Real time interactive streaming digital human
☆8,450Updated this week