kdrkdrkdr / JA2ML-VITS
☆13Updated this week
Related projects: ⓘ
- Bilingual-TTS (Japanese and Korean)☆26Updated last year
- Render wav and convert it with [Diff-SVC](https://github.com/prophesier/diff-svc) model☆10Updated last year
- BEGANSing - Korean SVS + SVC + AudioSR☆12Updated 7 months ago
- 'Grad-TTS' with Multilingual Cleaners☆10Updated 5 months ago
- Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…☆15Updated last year
- Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)☆74Updated 6 months ago
- ☆27Updated 10 months ago
- Official Demo Page for DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer☆28Updated 3 weeks ago
- Convert Korean to Katakana☆11Updated 9 months ago
- AudioSR-Upsampling (any -> 48kHz)☆38Updated 7 months ago
- singing voice conversion without f0☆22Updated last year
- ☆10Updated last month
- 4G GPU & 10 Minutes for train☆12Updated last year
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆11Updated 5 months ago
- vits2 backbone with multilingual-bert(한국어 지원)☆24Updated 5 months ago
- ☆21Updated 2 weeks ago
- ☆10Updated last year
- End-to-End SpeechSynthesis system with fastspeech2 & hifigan☆13Updated 2 years ago
- My vocoder experiments☆20Updated last month
- ☆11Updated last year
- ☆13Updated 9 months ago
- VITS(Data Preprocessing + Whisper ASR + Text Preprocessing + Modification config.json + Training, Inference)☆35Updated 6 months ago
- 44100Hz日本語音源に対応させた unofficial vits2-TTS implementation in pytorchです。☆22Updated last year
- ☆14Updated 4 months ago
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆42Updated 5 months ago
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform with Multilin…☆62Updated last year
- ☆19Updated last year
- A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.☆42Updated last year
- FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS☆21Updated last year
- Aligner for text-to-speech☆15Updated 2 months ago