vtuber-plan/vcvits

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/vtuber-plan/vcvits)

vtuber-plan / vcvits

Non Parallel Voice Conversion based on VITS

☆24

Alternatives and similar repositories for vcvits

Users that are interested in vcvits are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
Respaired / RiFornet_Vocoder
View on GitHub
a Neural Vocoder supporting Ring Attention, Conformer and NSF.
☆25Aug 1, 2025Updated 11 months ago
reppy4620 / x-vits
View on GitHub
☆14Aug 1, 2025Updated 11 months ago
mushanshanshan / ESLTTS
View on GitHub
ESLTTS dataset
☆16Feb 6, 2025Updated last year
BakerBunker / FreeV
View on GitHub
[InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter
☆98Jul 4, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
vtuber-plan / hifi-gan
View on GitHub
An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.
☆32Apr 10, 2023Updated 3 years ago
hcy71o / SNAC
View on GitHub
Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…
☆57Aug 7, 2023Updated 2 years ago
ryanrudes / YTTTS
View on GitHub
The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions
☆53Apr 1, 2021Updated 5 years ago
exercise-book-yq / FreeCodec
View on GitHub
FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS
☆24Sep 9, 2024Updated last year
tonnetonne814 / unofficial-vits2-44100-Ja
View on GitHub
44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。
☆24Sep 1, 2023Updated 2 years ago
hcy71o / AutoVocoder
View on GitHub
Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing
☆71Dec 2, 2022Updated 3 years ago
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
choiHkk / Transformer-TTS-V2
View on GitHub
☆25Mar 6, 2024Updated 2 years ago
cnaigithub / Auto_Tuning_Zeroshot_TTS_and_VC
View on GitHub
Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…
☆80May 29, 2023Updated 3 years ago
innnky / MagVITS
View on GitHub
VITS with phoneme-level prosody modeling based on MaskGIT
☆85Aug 31, 2024Updated last year
adelacvg / diff-vits
View on GitHub
☆39Oct 1, 2023Updated 2 years ago
QuyAnh2005 / vits-japanese
View on GitHub
Text to Speech for Japanese
☆16May 11, 2023Updated 3 years ago
misakiudon / MB-iSTFT-VITS-multilingual
View on GitHub
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform with Multilin…
☆72Nov 21, 2022Updated 3 years ago
adelacvg / DPTTS
View on GitHub
An AR+AR TTS attempt.
☆18Jan 13, 2025Updated last year
shivammehta25 / BetterFastSpeech2
View on GitHub
Just another FastSpeech 2 but cleaner code :)
☆29Jun 28, 2024Updated 2 years ago
anton-kashkin / hifi_vc
View on GitHub
☆25Jan 24, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
hs-oh-prml / DurFlexEVC
View on GitHub
☆82Jan 22, 2025Updated last year
fluxions-ai / stftvae
View on GitHub
Inference for the STFT-VAE continuous audio codec (24kHz, 3.125Hz latent)
☆43Jul 12, 2026Updated last week
lucadellalib / discrete-wavlm-codec
View on GitHub
A neural speech codec based on discrete WavLM representations
☆26Aug 28, 2024Updated last year
yuan1615 / AdaVocoder
View on GitHub
Adaptive Vocoder for Custom Voice
☆61Sep 22, 2022Updated 3 years ago
tabahi / contexless-phonemes-CUPE
View on GitHub
pytorch model for contexless-phoneme prediction from speech audio
☆32Oct 30, 2025Updated 8 months ago
thuhcsi / SnakeGAN
View on GitHub
Please visit https://thuhcsi.github.io/SnakeGAN/
☆37Apr 25, 2023Updated 3 years ago
harisreedhar / Portrait-Talker
View on GitHub
Talking head animation
☆26Dec 8, 2023Updated 2 years ago
MaxMax2016 / Grad-TTS-Chinese
View on GitHub
Huawei Grad-TTS for Chinese
☆50Sep 26, 2023Updated 2 years ago
p0p4k / vits3_pytorch
View on GitHub
☆28Nov 15, 2023Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
liuhuang31 / Megatts2_HierSpeechpp
View on GitHub
Megatts2 use HierSpeechpp's vocoder
☆18Dec 2, 2024Updated last year
adelacvg / NS2VC
View on GitHub
Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech
☆237Feb 29, 2024Updated 2 years ago
fishaudio / vocoder
View on GitHub
☆130Jul 6, 2026Updated 2 weeks ago
liuhuang31 / g2pw_once
View on GitHub
G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…
☆14Dec 30, 2023Updated 2 years ago
LAION-AI / emotional-speech-annotations
View on GitHub
This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models
☆35Oct 13, 2024Updated last year
MaxMax2016 / max-vc
View on GitHub
singing voice conversion without f0
☆23May 10, 2023Updated 3 years ago
winddori2002 / TriAAN-VC
View on GitHub
TriAAN-VC: Triple Adaptive Attention Normalization for Any-to-Any Voice Conversion
☆146Jan 15, 2024Updated 2 years ago