nivibilla/efficient-vits-finetuning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nivibilla/efficient-vits-finetuning)

nivibilla / efficient-vits-finetuning

Finetuning VITS Efficiently

☆32

Alternatives and similar repositories for efficient-vits-finetuning

Users that are interested in efficient-vits-finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hcy71o / MB-iSTFT-VITS-with-AutoVocoder
View on GitHub
Incorporating AutoVocoder to MB-iSTFT-VITS
☆47Dec 1, 2022Updated 3 years ago
tts-tutorial / ijcai2021
View on GitHub
☆13Apr 4, 2023Updated 3 years ago
yihuitang / StyleTTS_Mandarin
View on GitHub
Implementation of StyleTTS for Mandarin
☆11Jun 22, 2023Updated 3 years ago
adelacvg / detail_tts
View on GitHub
All generative model in one for better TTS model
☆74Sep 8, 2024Updated last year
hcy71o / SC-VITS
View on GitHub
VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.
☆36Sep 21, 2022Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
rishikksh20 / MiniMax-TTS-pytorch
View on GitHub
Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report
☆47Sep 2, 2025Updated 10 months ago
BridgetteSong / Tacotron2
View on GitHub
☆13Sep 21, 2022Updated 3 years ago
heatz123 / naturalspeech
View on GitHub
A fully working pytorch implementation of NaturalSpeech (Tan et al., 2022)
☆475Feb 7, 2024Updated 2 years ago
lifeiteng / NaturalSpeech2
View on GitHub
☆33Jun 29, 2023Updated 3 years ago
hs-oh-prml / DiffProsody
View on GitHub
☆69Jul 29, 2023Updated 3 years ago
shang0712 / HierTTS
View on GitHub
☆47Apr 16, 2023Updated 3 years ago
FENRlR / MB-iSTFT-VITS2
View on GitHub
Application of MB-iSTFT-VITS components to vits2_pytorch
☆135Dec 29, 2025Updated 7 months ago
huangruizhe / audio
View on GitHub
Data manipulation and transformation for audio signal processing, powered by PyTorch
☆10Sep 30, 2024Updated last year
xingchensong / CosyVoice-ttsfrd
View on GitHub
☆25Jun 19, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
p0p4k / vits2_pytorch
View on GitHub
unofficial vits2-TTS implementation in pytorch
☆548Mar 28, 2024Updated 2 years ago
jdh-algo / JoyTTS
View on GitHub
☆41Jul 15, 2025Updated last year
pengzhendong / ngram-punctuator
View on GitHub
An N-gram punctuator for Chinese and English.
☆18Oct 14, 2025Updated 9 months ago
yxlu-0102 / IDEA-TTS
View on GitHub
Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis
☆27Mar 21, 2025Updated last year
pb-brainiac / semseg_od
View on GitHub
☆16Oct 3, 2023Updated 2 years ago
theodorblackbird / lina-speech
View on GitHub
Official implementation of the TTS model Lina-Speech
☆178Jan 9, 2025Updated last year
OlaWod / PitchVC
View on GitHub
PitchVC: Pitch Conditioned Any-to-Many Voice Conversion
☆35Jun 6, 2024Updated 2 years ago
Mddct / cosyvoice2-flow-optimized
View on GitHub
faster inference
☆27Jan 20, 2025Updated last year
e-c-k-e-r / vall-e
View on GitHub
An unofficial PyTorch implementation of VALL-E
☆88Aug 3, 2025Updated 11 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
teo-sl / Audio-Super-Resolution-ViT
View on GitHub
This repository contains the source code for the implementation of two deep learning models concerning the audio super resolution task.
☆14Mar 14, 2023Updated 3 years ago
keonlee9420 / StyleSpeech
View on GitHub
PyTorch Implementation of Meta-StyleSpeech : Multi-Speaker Adaptive Text-to-Speech Generation
☆197Feb 10, 2022Updated 4 years ago
Edresson / GE2E-Speaker-Encoder
View on GitHub
GE2E Speaker Encoder - Generalized End-To-End Loss for Speaker Verification
☆14May 17, 2020Updated 6 years ago
b04901014 / UUVC
View on GitHub
Official implementation for the paper: A Unified One-Shot Prosody and Speaker Conversion System with Self-Supervised Discrete Speech Unit…
☆83Jan 7, 2023Updated 3 years ago
adelacvg / NS2VC
View on GitHub
Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech
☆237Feb 29, 2024Updated 2 years ago
innnky / vispeech
View on GitHub
基于vits fastspeech2 visinger的tts模型
☆24Mar 9, 2023Updated 3 years ago
KathyReid / opensource-voice-tools
View on GitHub
A repo listing known open source voice tools, ordered by where they sit in the voice stack
☆28Sep 23, 2022Updated 3 years ago
duerig / StyleTTS2
View on GitHub
StyleTTS 2 Optimized Training Fork
☆32Feb 2, 2025Updated last year
chorusai / arpa2ipa
View on GitHub
A converter from Arpabet to IPA (see https://en.wikipedia.org/wiki/Arpabet)
☆17Jan 2, 2018Updated 8 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
adelacvg / ttts
View on GitHub
Train the next generation of TTS systems.
☆169Sep 13, 2024Updated last year
gmltmd789 / UnitSpeech
View on GitHub
An official implementation of "UnitSpeech: Speaker-adaptive Speech Synthesis with Untranscribed Data"
☆137Aug 17, 2023Updated 2 years ago
Fatfish588 / Dataset_Generator_For_VITS
View on GitHub
基于达摩院视频切割技术的视频转换为短音频的vits数据集生成工具 A VITS Dataset Generation Tool for Converting Video to Short Audio Based on Damo Academy Video Cutting T…
☆54Jan 17, 2024Updated 2 years ago
tabahi / contexless-phonemes-CUPE
View on GitHub
pytorch model for contexless-phoneme prediction from speech audio
☆32Oct 30, 2025Updated 8 months ago
yellowtownhz / STIGCN
View on GitHub
☆13Feb 3, 2021Updated 5 years ago
MasayaKawamura / MB-iSTFT-VITS
View on GitHub
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform
☆469Nov 17, 2022Updated 3 years ago
shivammehta25 / BetterFastSpeech2
View on GitHub
Just another FastSpeech 2 but cleaner code :)
☆29Jun 28, 2024Updated 2 years ago