Plachtaa/VITS-fast-fine-tuning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Plachtaa/VITS-fast-fine-tuning)

Plachtaa / VITS-fast-fine-tuning

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

☆5,019

Alternatives and similar repositories for VITS-fast-fine-tuning

Users that are interested in VITS-fast-fine-tuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fishaudio / Bert-VITS2
View on GitHub
vits2 backbone with multilingual-bert
☆8,773Updated this week
jaywalnut310 / vits
View on GitHub
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
☆7,882Dec 6, 2023Updated 2 years ago
PlayVoice / vits_chinese
View on GitHub
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!
☆1,231Feb 5, 2024Updated 2 years ago
svc-develop-team / so-vits-svc
View on GitHub
SoftVC VITS Singing Voice Conversion
☆28,146Nov 11, 2023Updated 2 years ago
CjangCjengh / MoeGoe
View on GitHub
Executable file for VITS inference
☆2,422Aug 22, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
innnky / emotional-vits
View on GitHub
无需情感标注的情感可控语音合成模型，基于VITS
☆1,393Mar 30, 2023Updated 3 years ago
SayaSS / vits-finetuning
View on GitHub
Fine-Tuning your VITS model using a pre-trained model
☆545May 2, 2023Updated 3 years ago
CjangCjengh / vits
View on GitHub
VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai
☆939Dec 6, 2023Updated 2 years ago
PlayVoice / whisper-vits-svc
View on GitHub
Core Engine of Singing Voice Conversion & Singing Voice Clone
☆2,861Apr 23, 2024Updated 2 years ago
RVC-Boss / GPT-SoVITS
View on GitHub
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
☆59,806Updated this week
Jack-Cherish / dsi
View on GitHub
Do Something Interesting缩写，做一些有趣的事
☆254Jan 10, 2025Updated last year
RVC-Project / Retrieval-based-Voice-Conversion-WebUI
View on GitHub
Easily train a good VC model with voice data <= 10 mins!
☆36,438Nov 24, 2024Updated last year
yxlllc / DDSP-SVC
View on GitHub
Real-time end-to-end singing voice conversion system based on DDSP (Differentiable Digital Signal Processing)
☆2,625Feb 22, 2026Updated 4 months ago
Plachtaa / VALL-E-X
View on GitHub
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/
☆7,937Feb 11, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
KevinWang676 / Bark-Voice-Cloning
View on GitHub
Bark Voice Cloning and Voice Cloning for Chinese Speech
☆2,948May 31, 2026Updated last month
voicepaw / so-vits-svc-fork
View on GitHub
so-vits-svc fork with realtime support, improved interface and more features.
☆9,327Updated this week
OpenTalker / SadTalker
View on GitHub
[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
☆13,952Jun 26, 2024Updated 2 years ago
Artrajz / vits-simple-api
View on GitHub
A simple VITS HTTP API, developed by extending Moegoe with additional features.
☆1,048May 18, 2026Updated 2 months ago
PriesiaMioShirakana / DragonianVoice
View on GitHub
多个SVC/TTS的C++推理库
☆1,126May 18, 2025Updated last year
p0p4k / vits2_pytorch
View on GitHub
unofficial vits2-TTS implementation in pytorch
☆549Mar 28, 2024Updated 2 years ago
babysor / MockingBird
View on GitHub
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
☆36,922Mar 3, 2026Updated 4 months ago
modelscope / KAN-TTS
View on GitHub
KAN-TTS is a speech-synthesis training framework, please try the demos we have posted at https://modelscope.cn/models?page=1&tasks=text-…
☆526Dec 28, 2023Updated 2 years ago
suno-ai / bark
View on GitHub
🔊 Text-Prompted Generative Audio Model
☆39,197Aug 19, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
wenet-e2e / wetts
View on GitHub
Production First and Production Ready End-to-End Text-to-Speech Toolkit
☆416Nov 20, 2025Updated 7 months ago
YYuX-1145 / Bert-VITS2-Integration-package
View on GitHub
vits2 backbone with bert
☆333Apr 13, 2024Updated 2 years ago
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,273Jun 9, 2026Updated last month
OpenTalker / video-retalking
View on GitHub
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
☆7,265Aug 5, 2024Updated last year
Executedone / Chinese-FastSpeech2
View on GitHub
基于标贝数据继续训练，同时对原本的FastSpeech2模型做了改进，引入了韵律表征以及韵律预测模块，使中文发音更生动且富有节奏
☆278Sep 10, 2023Updated 2 years ago
lifeiteng / vall-e
View on GitHub
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
☆2,205Sep 10, 2025Updated 10 months ago
openvpi / DiffSinger
View on GitHub
An advanced singing voice synthesis system with high fidelity, expressiveness, controllability and flexibility based on DiffSinger: Singi…
☆3,172Jun 28, 2026Updated 2 weeks ago
Rudrabha / Wav2Lip
View on GitHub
This repository contains the codes of "A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild", published at ACM Mult…
☆13,096Jun 22, 2025Updated last year
Zz-ww / SadTalker-Video-Lip-Sync
View on GitHub
本项目基于SadTalkers实现视频唇形合成的Wav2lip。通过以视频文件方式进行语音驱动生成唇形，设置面部区域可配置的增强方式进行合成唇形（人脸）区域画面增强，提高生成唇形的清晰度。使用DAIN 插帧的DL算法对生成视频进行补帧，补充帧间合成唇形的动作过渡，使合成的唇…
☆2,005Jun 4, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
flutydeer / audio-slicer
View on GitHub
A simple GUI application that slices audio with silence detection
☆1,456Apr 5, 2026Updated 3 months ago
zai-org / ChatGLM-6B
View on GitHub
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
☆41,024Jun 27, 2024Updated 2 years ago
coqui-ai / TTS
View on GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆45,756Aug 16, 2024Updated last year
FunAudioLLM / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆22,188May 25, 2026Updated last month
CjangCjengh / TTSModels
View on GitHub
☆622Nov 27, 2022Updated 3 years ago
w4123 / GenshinVoice
View on GitHub
Voice dataset of Genshin Impact 原神语音数据集
☆722Jul 5, 2023Updated 3 years ago
Ikaros-521 / AI-Vtuber
View on GitHub
AI Vtuber是一个由【ChatterBot/ChatGPT/claude/langchain/chatglm/text-gen-webui/闻达/千问/kimi/ollama】驱动的虚拟主播【Live2D/UE/xuniren】，可以在【Bilibili/抖音/…
☆4,399Jul 29, 2025Updated 11 months ago