kdrkdrkdr/JK-VITS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/kdrkdrkdr/JK-VITS)

kdrkdrkdr / JK-VITS

Bilingual-TTS (Japanese and Korean)

☆32

Alternatives and similar repositories for JK-VITS

Users that are interested in JK-VITS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

reppy4620 / x-vits
View on GitHub
☆14Aug 1, 2025Updated last year
kdrkdrkdr / RVC-VITS
View on GitHub
Few-shot multilingual tts with RVC and Vits
☆50Jun 15, 2023Updated 3 years ago
shengcanxu / canoSpeech
View on GitHub
text to speech
☆10Mar 19, 2024Updated 2 years ago
jisang93 / VISinger
View on GitHub
Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…
☆20May 12, 2023Updated 3 years ago
Jackson-Kang / MFARunner
View on GitHub
A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.
☆45May 25, 2023Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
kdrkdrkdr / VALL-E-Korean
View on GitHub
VALL-E 한국어 버전
☆12Aug 22, 2023Updated 2 years ago
hcy71o / TransferTTS
View on GitHub
TransferTTS (Zero-Shot learning of VITS)
☆102Sep 23, 2022Updated 3 years ago
ORI-Muchim / One-Click-VITS-Training
View on GitHub
VITS(Data Preprocessing + Whisper ASR + Text Preprocessing + Modification config.json + Training, Inference)
☆36Feb 28, 2024Updated 2 years ago
CODEJIN / XiaoiceSing2
View on GitHub
☆19Feb 2, 2023Updated 3 years ago
DDATT / Vits2-onnx-cpp
View on GitHub
Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++
☆19Apr 17, 2024Updated 2 years ago
jwj7140 / Bert-VITS2-Korean
View on GitHub
vits2 backbone with multilingual-bert(한국어 지원)
☆28Apr 6, 2024Updated 2 years ago
misakiudon / MB-iSTFT-VITS-multilingual
View on GitHub
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform with Multilin…
☆72Nov 21, 2022Updated 3 years ago
anton-kashkin / hifi_vc
View on GitHub
☆25Jan 24, 2023Updated 3 years ago
ORI-Muchim / PolyLangVITS
View on GitHub
Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)
☆75Feb 28, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
reppy4620 / convnext_tts
View on GitHub
Unofficial implementation of ConvNeXt-TTS powered by lightning
☆18Oct 20, 2024Updated last year
p0p4k / Matcha-TTS-2
View on GitHub
E2E TTS using Conditional Flow Matching (Experimental*)
☆71Nov 10, 2023Updated 2 years ago
liuhuang31 / g2pw_once
View on GitHub
G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…
☆14Dec 30, 2023Updated 2 years ago
adelacvg / diff-vits
View on GitHub
☆39Oct 1, 2023Updated 2 years ago
Takaaki-Saeki / zm-text-tts
View on GitHub
[IJCAI'23] Learning to Speak from Text for Low-Resource TTS
☆65May 30, 2023Updated 3 years ago
shivammehta25 / BetterFastSpeech2
View on GitHub
Just another FastSpeech 2 but cleaner code :)
☆29Jun 28, 2024Updated 2 years ago
cnaigithub / Auto_Tuning_Zeroshot_TTS_and_VC
View on GitHub
Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…
☆80May 29, 2023Updated 3 years ago
hcy71o / MB-iSTFT-VITS-with-AutoVocoder
View on GitHub
Incorporating AutoVocoder to MB-iSTFT-VITS
☆47Dec 1, 2022Updated 3 years ago
scutcsq / Neural-Transducers-for-Two-Stage-Text-to-Speech-via-Semantic-Token-Prediction
View on GitHub
Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…
☆60Apr 4, 2024Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
lars76 / fastspeech2-clean
View on GitHub
Clean and modernized implementation of FastSpeech2/LightSpeech using IPA
☆19Aug 16, 2024Updated last year
p1an-lin-jung / wv_tts
View on GitHub
☆19Mar 22, 2024Updated 2 years ago
zjwang21 / mix-phoneme-bert
View on GitHub
An unofficial PyTorch implementation of Mix-Phoneme-Bert
☆40Jul 10, 2023Updated 3 years ago
jhwanflow / Korean-Voice-Cloning
View on GitHub
Clone a voice in 5 seconds to generate arbitrary speech in real-time
☆28Jun 7, 2022Updated 4 years ago
X-E-Speech / X-E-Speech-code
View on GitHub
X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion
☆112Apr 1, 2024Updated 2 years ago
seastar105 / pflow-encodec
View on GitHub
Implementation of TTS model based on NVIDIA P-Flow TTS Paper
☆77Jul 13, 2026Updated 2 weeks ago
hs-oh-prml / DurFlexEVC
View on GitHub
☆82Jan 22, 2025Updated last year
huutuongtu / Lightvoc
View on GitHub
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18May 17, 2024Updated 2 years ago
hcy71o / SNAC
View on GitHub
Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…
☆57Aug 7, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
audiodemo / voice-conversion
View on GitHub
Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks
☆17Aug 18, 2023Updated 2 years ago
esoyeon / KoreanTTS
View on GitHub
Korean Text To Speech Project: Using Tacotron1, Tacotron2, Wavenet and Melgan
☆40Nov 15, 2024Updated last year
declare-lab / HyperTTS
View on GitHub
☆40Apr 15, 2024Updated 2 years ago
AkshathRaghav / tinyspeech
View on GitHub
Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"
☆23Jun 7, 2025Updated last year
tonnetonne814 / SiFi-VITS2-44100-Ja
View on GitHub
DDPM-based Pitch Generation and Pitch Controllable Voice Synthesis.
☆55Sep 25, 2023Updated 2 years ago
mush42 / istft-onnx
View on GitHub
Export an ONNX graph that performs ISTFT. Designed for TTS models.
☆28Apr 23, 2024Updated 2 years ago
XiangLi2022 / CM-TTS
View on GitHub
[Findings of NAACL 2024] Source code of paper CM-TTS: Enhancing Real Time Text-to-Speech Synthesis Efficiency through Weighted Samplers a…
☆68Mar 31, 2024Updated 2 years ago