RVC-Boss/GPT-SoVITS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RVC-Boss/GPT-SoVITS)

RVC-Boss / GPT-SoVITS

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

☆60,135

Alternatives and similar repositories for GPT-SoVITS

Users that are interested in GPT-SoVITS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

2noise / ChatTTS
View on GitHub
A generative speech model for daily dialogue.
☆39,683Apr 10, 2026Updated 3 months ago
QwenAudio / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆22,418May 25, 2026Updated 2 months ago
RVC-Project / Retrieval-based-Voice-Conversion-WebUI
View on GitHub
Easily train a good VC model with voice data <= 10 mins!
☆36,730Updated this week
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,382Updated this week
fishaudio / Bert-VITS2
View on GitHub
vits2 backbone with multilingual-bert
☆8,781Jul 20, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
svc-develop-team / so-vits-svc
View on GitHub
SoftVC VITS Singing Voice Conversion
☆28,153Nov 11, 2023Updated 2 years ago
myshell-ai / OpenVoice
View on GitHub
Instant voice cloning by MIT and MyShell. Audio foundation model.
☆37,026Apr 19, 2025Updated last year
Comfy-Org / ComfyUI
View on GitHub
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
☆122,349Updated this week
AUTOMATIC1111 / stable-diffusion-webui
View on GitHub
Stable Diffusion web UI
☆164,277Mar 2, 2026Updated 4 months ago
index-tts / index-tts
View on GitHub
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
☆22,170Jul 14, 2026Updated last week
coqui-ai / TTS
View on GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆45,820Aug 16, 2024Updated last year
SWivid / F5-TTS
View on GitHub
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
☆15,017Updated this week
facefusion / facefusion
View on GitHub
Industry leading face manipulation platform
☆29,400Updated this week
modelscope / FunASR
View on GitHub
Open-source speech recognition toolkit for training, inference, streaming ASR, VAD, punctuation, speaker diarization pipelines, and OpenA…
☆19,483Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
babysor / MockingBird
View on GitHub
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
☆36,927Mar 3, 2026Updated 4 months ago
langgenius / dify
View on GitHub
Build Agentic workflows, RAG pipelines, with rich AI model and tool support on one collaborative workspace. Deploy on cloud, VPC, or self…
☆150,308Updated this week
ChatGPTNextWeb / NextChat
View on GitHub
✨ Light and Fast AI Assistant. Support: Web | iOS | MacOS | Android | Linux | Windows
☆88,555Jul 6, 2026Updated 3 weeks ago
jianchang512 / clone-voice
View on GitHub
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具，使用你的音色或任意声音来录制音频
☆8,985Aug 29, 2025Updated 10 months ago
lobehub / lobehub
View on GitHub
🤯 LobeHub is your Chief Agent Operator, organizing your agents into 7×24 operations by hiring, scheduling, and reporting on your entire …
☆80,830Updated this week
jianchang512 / pyvideotrans
View on GitHub
Translate the video from one language to another and embed dubbing & subtitles.
☆18,444Updated this week
suno-ai / bark
View on GitHub
🔊 Text-Prompted Generative Audio Model
☆39,216Aug 19, 2024Updated last year
OpenTalker / SadTalker
View on GitHub
[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
☆13,973Jun 26, 2024Updated 2 years ago
songquanpeng / one-api
View on GitHub
LLM API 管理 & 分发系统，支持 OpenAI、Azure、Anthropic Claude、Google Gemini、DeepSeek、字节豆包、ChatGLM、文心一言、讯飞星火、通义千问、360 智脑、腾讯混元等主流模型，统一 API 适配，可用于 key …
☆35,955Jan 9, 2026Updated 6 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
QwenAudio / SenseVoice
View on GitHub
Open-source SenseVoiceSmall model for Mandarin, Cantonese, English, Japanese, and Korean ASR, language ID, emotion recognition, and audio…
☆8,940Updated this week
Huanshere / VideoLingo
View on GitHub
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切…
☆17,860Jul 2, 2026Updated 3 weeks ago
jianchang512 / ChatTTS-ui
View on GitHub
一个简单的本地网页界面，使用ChatTTS将文字合成为语音，同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with su…
☆7,626Jun 14, 2026Updated last month
Anjok07 / ultimatevocalremovergui
View on GitHub
GUI for a Vocal Remover that uses Deep Neural Networks.
☆25,534Mar 13, 2025Updated last year
myshell-ai / MeloTTS
View on GitHub
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
☆7,551Dec 24, 2024Updated last year
netease-youdao / EmotiVoice
View on GitHub
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
☆8,499Aug 13, 2024Updated last year
labring / FastGPT
View on GitHub
FastGPT is a knowledge-based platform built on the LLMs, offers a comprehensive suite of out-of-the-box capabilities such as data process…
☆29,134Updated this week
jaywalnut310 / vits
View on GitHub
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
☆7,887Dec 6, 2023Updated 2 years ago
rany2 / edge-tts
View on GitHub
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
☆11,587Mar 22, 2026Updated 4 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zai-org / ChatGLM-6B
View on GitHub
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
☆41,021Jun 27, 2024Updated 2 years ago
hiroi-sora / Umi-OCR
View on GitHub
OCR software, free and offline. 开源、免费的离线OCR软件。支持截屏/批量导入图片，PDF文档识别，排除水印/页眉页脚，扫描/生成二维码。内置多国语言库。
☆46,242Nov 20, 2025Updated 8 months ago
open-mmlab / Amphion
View on GitHub
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…
☆9,967Mar 25, 2026Updated 4 months ago
openai / whisper
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆105,641Apr 15, 2026Updated 3 months ago
Plachtaa / VITS-fast-fine-tuning
View on GitHub
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
☆5,019Jan 21, 2025Updated last year
chatchat-space / Langchain-Chatchat
View on GitHub
Langchain-Chatchat（原Langchain-ChatGLM）基于 Langchain 与 ChatGLM, Qwen 与 Llama 等语言模型的 RAG 与 Agent 应用 | Langchain-Chatchat (formerly langchain…
☆38,475Nov 10, 2025Updated 8 months ago
ollama / ollama
View on GitHub
Get up and running with Kimi-K2.6, GLM-5.2, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
☆176,923Updated this week