fishaudio/Bert-VITS2

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/fishaudio/Bert-VITS2)

fishaudio / Bert-VITS2

vits2 backbone with multilingual-bert

☆8,782

Alternatives and similar repositories for Bert-VITS2

Users that are interested in Bert-VITS2 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RVC-Boss / GPT-SoVITS
View on GitHub
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
☆60,205Jul 22, 2026Updated last week
Plachtaa / VITS-fast-fine-tuning
View on GitHub
This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion
☆5,019Jan 21, 2025Updated last year
jiangyuxiaoxiao / Bert-VITS2-UI
View on GitHub
BertVITS2前端界面
☆304Jan 1, 2024Updated 2 years ago
anyvoiceai / MassTTS
View on GitHub
a TTS demo for training new characters.
☆472Jan 5, 2024Updated 2 years ago
svc-develop-team / so-vits-svc
View on GitHub
SoftVC VITS Singing Voice Conversion
☆28,151Nov 11, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,559Updated this week
PlayVoice / vits_chinese
View on GitHub
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!
☆1,229Feb 5, 2024Updated 2 years ago
YYuX-1145 / Bert-VITS2-Integration-package
View on GitHub
vits2 backbone with bert
☆333Apr 13, 2024Updated 2 years ago
jaywalnut310 / vits
View on GitHub
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
☆7,889Dec 6, 2023Updated 2 years ago
RVC-Project / Retrieval-based-Voice-Conversion-WebUI
View on GitHub
Easily train a good VC model with voice data <= 10 mins!
☆36,798Jul 23, 2026Updated last week
KevinWang676 / Bark-Voice-Cloning
View on GitHub
Bark Voice Cloning and Voice Cloning for Chinese Speech
☆2,949May 31, 2026Updated last month
innnky / emotional-vits
View on GitHub
无需情感标注的情感可控语音合成模型，基于VITS
☆1,392Mar 30, 2023Updated 3 years ago
p0p4k / vits2_pytorch
View on GitHub
unofficial vits2-TTS implementation in pytorch
☆548Mar 28, 2024Updated 2 years ago
QwenAudio / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆22,464May 25, 2026Updated 2 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
PlayVoice / whisper-vits-svc
View on GitHub
Core Engine of Singing Voice Conversion & Singing Voice Clone
☆2,863Apr 23, 2024Updated 2 years ago
Artrajz / vits-simple-api
View on GitHub
A simple VITS HTTP API, developed by extending Moegoe with additional features.
☆1,050May 18, 2026Updated 2 months ago
daniilrobnikov / vits2
View on GitHub
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
☆643Sep 11, 2023Updated 2 years ago
AI-Hobbyist / Genshin_Datasets
View on GitHub
Genshin Datasets For SVC/SVS/TTS
☆734Jan 11, 2026Updated 6 months ago
2noise / ChatTTS
View on GitHub
A generative speech model for daily dialogue.
☆39,696Apr 10, 2026Updated 3 months ago
Plachtaa / VALL-E-X
View on GitHub
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
☆7,935Feb 11, 2024Updated 2 years ago
netease-youdao / EmotiVoice
View on GitHub
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
☆8,501Aug 13, 2024Updated last year
yxlllc / DDSP-SVC
View on GitHub
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
☆2,635Updated this week
innnky / ar-vits
View on GitHub
text to speech using autoregressive transformer and VITS
☆248Apr 3, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
OpenTalker / SadTalker
View on GitHub
[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
☆13,973Jun 26, 2024Updated 2 years ago
cronrpc / SubFix
View on GitHub
SubFix: Efficient Web-Based Audio Subtitle Editing and Multilingual Automatic Annotation Tool.
☆206Feb 5, 2024Updated 2 years ago
litagin02 / Style-Bert-VITS2
View on GitHub
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles.
☆1,341Dec 7, 2025Updated 7 months ago
modelscope / FunASR
View on GitHub
Open-source speech recognition toolkit for training, inference, streaming ASR, VAD, punctuation, speaker diarization pipelines, and OpenA…
☆19,542Updated this week
coqui-ai / TTS
View on GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆45,836Aug 16, 2024Updated last year
Ikaros-521 / AI-Vtuber
View on GitHub
AI Vtuber是一个由【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】驱动的虚拟主播【Live2D/UE/xuniren】，可以在【Bilibili/抖音/…
☆4,410Jul 29, 2025Updated last year
OpenTalker / video-retalking
View on GitHub
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
☆7,269Aug 5, 2024Updated last year
QwenAudio / SenseVoice
View on GitHub
Open-source SenseVoiceSmall model for Mandarin, Cantonese, English, Japanese, and Korean ASR, language ID, emotion recognition, and audio…
☆8,954Updated this week
Anjok07 / ultimatevocalremovergui
View on GitHub
GUI for a Vocal Remover that uses Deep Neural Networks.
☆25,562Mar 13, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
voicepaw / so-vits-svc-fork
View on GitHub
so-vits-svc fork with realtime support, improved interface and more features.
☆9,328Updated this week
modelscope / KAN-TTS
View on GitHub
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-…
☆525Dec 28, 2023Updated 2 years ago
babysor / MockingBird
View on GitHub
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
☆36,913Mar 3, 2026Updated 4 months ago
suno-ai / bark
View on GitHub
🔊 Text-Prompted Generative Audio Model
☆39,215Aug 19, 2024Updated last year
PriesiaMioShirakana / DragonianVoice
View on GitHub
多个SVC/TTS的C++推理库
☆1,128May 18, 2025Updated last year
CjangCjengh / MoeGoe
View on GitHub
Executable file for VITS inference
☆2,423Aug 22, 2023Updated 2 years ago
open-mmlab / Amphion
View on GitHub
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junio…
☆9,972Mar 25, 2026Updated 4 months ago