PlayVoice/VI-SVC

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PlayVoice/VI-SVC)

PlayVoice / VI-SVC

VI-SVC model is just VITS without MAS and DurationPredictor.

☆10

Alternatives and similar repositories for VI-SVC

Users that are interested in VI-SVC are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MaxMax2016 / Glow-SVC
View on GitHub
4G GPU & 10 Minutes for train
☆12Aug 9, 2023Updated 2 years ago
uthree / fastersvc
View on GitHub
☆26Mar 20, 2024Updated 2 years ago
innnky / glow-svc
View on GitHub
singing voice conversion based on glow-tts
☆12Aug 20, 2023Updated 2 years ago
KdaiP / conformer-RoPE
View on GitHub
Conformer block with Rotary Position Embedding, modified from lucidrains' implement
☆19Sep 13, 2024Updated last year
Adibian / ResGrad
View on GitHub
Unofficial implementation of ResGrad: Residual Denoising Diffusion Probabilistic Models for Text to Speech
☆20Feb 9, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
PlayVoice / BigVGAN
View on GitHub
BigVGAN with Neural Source-Filter
☆58Sep 21, 2023Updated 2 years ago
CODEJIN / XiaoiceSing2
View on GitHub
☆19Feb 2, 2023Updated 3 years ago
zengchang233 / CrossSinger
View on GitHub
The source code for the paper CrossSinger (asru2023)
☆18Oct 12, 2023Updated 2 years ago
lakahaga / dc-comix-tts
View on GitHub
Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer
☆74Aug 21, 2023Updated 2 years ago
hcy71o / SC-CNN
View on GitHub
SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems
☆39Nov 1, 2023Updated 2 years ago
mush42 / istft-onnx
View on GitHub
Export an ONNX graph that performs ISTFT. Designed for TTS models.
☆28Apr 23, 2024Updated 2 years ago
WelkinYang / WaveODE
View on GitHub
An ODE-based generative neural vocoder using Rectified Flow
☆58Apr 29, 2023Updated 3 years ago
tonnetonne814 / SiFi-VITS2-44100-Ja
View on GitHub
DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.
☆55Sep 25, 2023Updated 2 years ago
sp-uhh / 2sderev
View on GitHub
Two-stage Dereverberation Algorithm using DNN-supported multi-channel linear filtering and single-channel non-linear post-filtering
☆15Jan 10, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
drauger-os-development / gcde
View on GitHub
GTK+ Console Desktop Environment, a desktop environment to give Linux a game-console look and feel.
☆12Jan 15, 2021Updated 5 years ago
adelacvg / diff-vits
View on GitHub
☆39Oct 1, 2023Updated 2 years ago
reppy4620 / x-vits
View on GitHub
☆14Aug 1, 2025Updated 11 months ago
declare-lab / HyperTTS
View on GitHub
☆40Apr 15, 2024Updated 2 years ago
vtuber-plan / FlowVAE
View on GitHub
☆17Dec 12, 2023Updated 2 years ago
cpdu / vallt
View on GitHub
☆36Mar 14, 2025Updated last year
MiscellaneousStuff / PhoneLM
View on GitHub
(R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.
☆48Sep 4, 2023Updated 2 years ago
joansantoso / iox-adk
View on GitHub
☆18Jul 11, 2025Updated last year
yxlu-0102 / IDEA-TTS
View on GitHub
Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis
☆27Mar 21, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
neverix / plai-py
View on GitHub
Play Minecraft with AI
☆11Jul 20, 2022Updated 4 years ago
0913ktg / SC_VALL-E
View on GitHub
Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E
☆136Oct 23, 2024Updated last year
daniel5151 / osxhidtouch
View on GitHub
User-space HID multitouch touchscreen driver for Mac OS X (Adapted for XPS 15 9560 from kyewei/osxhidtouch)
☆19Sep 2, 2017Updated 8 years ago
Ereboas / TacoLM
View on GitHub
☆19May 2, 2024Updated 2 years ago
sungnyun / ARMHuBERT
View on GitHub
(Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT
☆41Aug 29, 2024Updated last year
ZehuaKcrissLi / GTR-Voice
View on GitHub
☆16Nov 11, 2024Updated last year
p0p4k / pflowtts_pytorch
View on GitHub
Unofficial implementation of NVIDIA P-Flow TTS paper
☆228Dec 24, 2024Updated last year
breizhn / tPLCnet
View on GitHub
This repository contains the trained models and some audio samples for the tPLCnet.
☆29Sep 26, 2023Updated 2 years ago
zjlww / papers
View on GitHub
Connected Papers knockoff, managing academic papers and citations with graph database.
☆12Dec 26, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
huutuongtu / Lightvoc
View on GitHub
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18May 17, 2024Updated 2 years ago
choiHkk / nix-tts
View on GitHub
End-To-End SpeechSynthesis system with knowledge distillation
☆18Jul 16, 2022Updated 4 years ago
Zhongxu-Wang / ArtSpeech
View on GitHub
ArtSpeech: Adaptive Text-to-Speech Synthesis with Articulatory Representations
☆22Sep 21, 2025Updated 10 months ago
papercup-open-source / subscale-wavernn
View on GitHub
Implementation of the subscale framework from the WaveRNN paper, building on top of Fatchord's WaveRNN repo
☆19Oct 8, 2020Updated 5 years ago
PlayVoice / Grad-SVC
View on GitHub
Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei
☆170Oct 24, 2023Updated 2 years ago
zengchang233 / xiaoicesing2
View on GitHub
The source code for the paper XiaoiceSing2 (interspeech2023)
☆49Jan 15, 2024Updated 2 years ago
zy-du / Disentanglement-of-Emotional-Style-and-Speaker-Identity-for-Expressive-Voice-Conversion
View on GitHub
This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…
☆21Sep 18, 2023Updated 2 years ago