sophiefy/VITS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sophiefy/VITS)

sophiefy / VITS

ACG Text-to-Speech

☆177

Alternatives and similar repositories for VITS

Users that are interested in VITS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sophiefy / VITS-Bilingual
View on GitHub
Chinese-Japanese Bilingual Text-to-Speech
☆32Aug 30, 2022Updated 3 years ago
luoyily / MoeTTS
View on GitHub
Speech synthesis model /inference GUI repo for galgame characters based on Tacotron2, Hifigan, VITS and Diff-svc
☆990Mar 3, 2023Updated 3 years ago
sophiefy / Sovits
View on GitHub
An unofficial implementation of the combination of Soft-VC and VITS
☆456Nov 13, 2022Updated 3 years ago
CjangCjengh / MoeGoe
View on GitHub
Executable file for VITS inference
☆2,423Aug 22, 2023Updated 2 years ago
AlexandaJerry / whisper-vits-japanese
View on GitHub
Vits Japanese with Whisper as data processor (you can train your VITS even you only have audios)
☆162May 7, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
innnky / emotional-vits
View on GitHub
无需情感标注的情感可控语音合成模型，基于VITS
☆1,392Mar 30, 2023Updated 3 years ago
CjangCjengh / TTSModels
View on GitHub
☆622Nov 27, 2022Updated 3 years ago
Kanade-nya / PJSK-MultiGUI
View on GitHub
PJSK-Vits GUI
☆112Jul 24, 2025Updated 11 months ago
bluenekozkm / moe-tts-webui
View on GitHub
The better web ui for MOE-TTS
☆24Nov 11, 2023Updated 2 years ago
CjangCjengh / vits
View on GitHub
VITS implementation of Japanese, Chinese, Korean, Sanskrit and Thai
☆939Dec 6, 2023Updated 2 years ago
MasayaKawamura / MB-iSTFT-VITS
View on GitHub
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform
☆469Nov 17, 2022Updated 3 years ago
SayaSS / vits-finetuning
View on GitHub
Fine-Tuning your VITS model using a pre-trained model
☆544May 2, 2023Updated 3 years ago
CjangCjengh / MoeGoe_GUI
View on GitHub
GUI for MoeGoe
☆572Aug 22, 2023Updated 2 years ago
AlexandaJerry / vits-mandarin-biaobei
View on GitHub
application of vits on mandarin tts
☆121May 11, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
PlayVoice / lora-svc
View on GitHub
singing voice change based on whisper, and lora for singing voice clone
☆648Nov 3, 2023Updated 2 years ago
anton-kashkin / hifi_vc
View on GitHub
☆25Jan 24, 2023Updated 3 years ago
PriesiaMioShirakana / DragonianVoice
View on GitHub
多个SVC/TTS的C++推理库
☆1,126May 18, 2025Updated last year
bshall / soft-vc
View on GitHub
Soft speech units for voice conversion
☆455Mar 14, 2024Updated 2 years ago
A-kirami / nonebot-plugin-logpile
View on GitHub
☆10Feb 6, 2025Updated last year
CjangCjengh / tacotron2-japanese
View on GitHub
Tacotron2 implementation of Japanese
☆267Sep 4, 2022Updated 3 years ago
Takenoko3333 / remove-meta-alpha
View on GitHub
This tool allows you to process multiple images simultaneously, including removing metadata and alpha channels from the images. / 本ツールは、複…
☆10Dec 20, 2023Updated 2 years ago
Takaaki-Saeki / zm-text-tts
View on GitHub
[IJCAI'23] Learning to Speak from Text for Low-Resource TTS
☆65May 30, 2023Updated 3 years ago
fumiama / MoeGoe
View on GitHub
MoeGoe Azure Cloud Function API
☆52Aug 27, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
jaywalnut310 / vits
View on GitHub
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
☆7,885Dec 6, 2023Updated 2 years ago
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
kdrkdrkdr / ko2kana
View on GitHub
Convert Korean to Katakana
☆13Dec 13, 2023Updated 2 years ago
bshall / hubert
View on GitHub
HuBERT content encoders for: A Comparison of Discrete and Soft Speech Units for Improved Voice Conversion
☆405Oct 1, 2024Updated last year
PlayVoice / VI-Speaker
View on GitHub
Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.
☆30Sep 16, 2022Updated 3 years ago
anonymous-pits / pits
View on GitHub
PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor
☆280Jul 16, 2023Updated 3 years ago
hcy71o / MB-iSTFT-VITS-with-AutoVocoder
View on GitHub
Incorporating AutoVocoder to MB-iSTFT-VITS
☆47Dec 1, 2022Updated 3 years ago
yl4579 / StyleTTS
View on GitHub
Official Implementation of StyleTTS
☆466Jan 13, 2025Updated last year
PlayVoice / VI-SVS
View on GitHub
Singing Voice Synthesis based on VITS, different from VISinger
☆198Nov 13, 2023Updated 2 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
JusperLee / Gull-Codec-Training
View on GitHub
☆12Mar 11, 2025Updated last year
innnky / ar-vits
View on GitHub
text to speech using autoregressive transformer and VITS
☆248Apr 3, 2024Updated 2 years ago
MaxMax2016 / Glow-SVC
View on GitHub
4G GPU & 10 Minutes for train
☆12Aug 9, 2023Updated 2 years ago
babe269 / performant
View on GitHub
A toolset for easy formant extraction and visualization from wav files and TTS models
☆33Sep 2, 2022Updated 3 years ago
AdjointOperator / Augmented-DDTagger
View on GitHub
Multi-backend (WD taggers, deepdanbooru) fast automatic tagging utility
☆26Feb 3, 2023Updated 3 years ago
SonderXiaoming / portune
View on GitHub
☆10Dec 1, 2022Updated 3 years ago
yzGuu830 / efficient-speech-codec
View on GitHub
[EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers
☆126Mar 20, 2025Updated last year