kdrkdrkdr / JA2ML-VITSLinks

Japanese Dataset to Multi Language TTS (Only for Japanese Dataset)

☆3

Alternatives and similar repositories for JA2ML-VITS

Users that are interested in JA2ML-VITS are comparing it to the libraries listed below

Sorting:

litagin02 / laughter-collector
大量の音声データから笑い声部分を集めるやつ
☆10Updated last year
kdrkdrkdr / JK-VITS
Bilingual-TTS (Japanese and Korean)
☆31Updated 2 years ago
ORI-Muchim / BEGANSing
BEGANSing - Korean SVS + SVC + AudioSR
☆11Updated last year
ORI-Muchim / PolyLangVITS
Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)
☆76Updated last year
reppy4620 / x-vits
☆13Updated this week
oatsu-gh / utau_renderer_with_diff_svc
Render wav and convert it with [Diff-SVC](https://github.com/prophesier/diff-svc) model
☆10Updated 2 years ago
MaxMax2016 / Glow-SVC
4G GPU & 10 Minutes for train
☆12Updated last year
jisang93 / VISinger
Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…
☆15Updated 2 years ago
adelacvg / diff-vits
☆39Updated last year
p0p4k / vits3_pytorch
☆29Updated last year
ORI-Muchim / AudioSR-Upsampling
AudioSR-Upsampling (any -> 48kHz)
☆41Updated last year
kdrkdrkdr / VALL-E-Korean
VALL-E 한국어 버전
☆12Updated last year
Tikai7 / DiTTO-TTS
DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors
☆28Updated 5 months ago
misakiudon / MB-iSTFT-VITS-multilingual
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform with Multilin…
☆67Updated 2 years ago
zengchang233 / CrossSinger
The source code for the paper CrossSinger (asru2023)
☆18Updated last year
MaxMax2016 / max-vc
singing voice conversion without f0
☆23Updated 2 years ago
Scarfmonster / HiFiPLN
Multispeaker Community Vocoder Model for DiffSinger
☆37Updated last week
AlexandaJerry / SingingVoice-MFA-Training
MFA acoustic model training based on Opencpop
☆15Updated 2 years ago
tenebo / g2pk2
Updated folk of g2pk
☆12Updated last year
innnky / glow-svc
singing voice conversion based on glow-tts
☆11Updated last year
CODEJIN / XiaoiceSing2
☆19Updated 2 years ago
PlayVoice / VI-SVC
VI-SVC model is just VITS without MAS and DurationPredictor.
☆10Updated last year
monglechap / fluenttts
FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS
☆20Updated 2 years ago
hcy71o / SC-CNN
SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems
☆39Updated last year
Jackson-Kang / MFARunner
A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.
☆45Updated 2 years ago
reppy4620 / vocoders
My vocoder experiments
☆30Updated last week
Ereboas / TacoLM
☆19Updated last year
anton-kashkin / hifi_vc
☆25Updated 2 years ago
vtuber-plan / FlowVAE
☆15Updated last year
kdrkdrkdr / RVC-VITS
Few-shot multilingual tts with RVC and Vits
☆51Updated 2 years ago