ORI-Muchim / MB-iSTFT-VITS-Korean
Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform with Korean Cleaners
☆14Updated last year
Alternatives and similar repositories for MB-iSTFT-VITS-Korean:
Users that are interested in MB-iSTFT-VITS-Korean are comparing it to the libraries listed below
- vits2 backbone with multilingual-bert(한국어 지원)☆26Updated last year
- Bilingual-TTS (Japanese and Korean)☆30Updated last year
- Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Time Fourier Transform with Multilin…☆67Updated 2 years ago
- A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.☆44Updated last year
- ☆16Updated last year
- Korean ASR Corpus generated from TEDx talks☆27Updated 6 years ago
- Japanese Dataset to Multi Language TTS (Only for Japanese Dataset)☆3Updated last year
- VITS(Data Preprocessing + Whisper ASR + Text Preprocessing + Modification config.json + Training, Inference)☆38Updated last year
- Use FastSpeech2 and HiFi-GAN to easily perform end-to-end Korean speech synthesis.☆29Updated last year
- Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)☆30Updated 4 years ago
- ☆11Updated 5 months ago
- wav2vec를 사용한 STT 기능을 사용하여 음성인식 및 PPT 도우미 기능을 추가☆9Updated 2 years ago
- A dash app that transcribes 한글 into [hɑŋɡɯl].☆29Updated 2 months ago
- ☆13Updated 5 months ago
- ☆13Updated 7 months ago
- Few-shot multilingual tts with RVC and Vits☆50Updated last year
- End-to-End SpeechSynthesis system with fastspeech2 & hifigan☆13Updated 2 years ago
- Use openvoice v2 module to do real time tts(text to speech) task for on-device robotics. Trying to inference the model on single board li…☆12Updated 6 months ago
- ☆24Updated 7 months ago
- ☆16Updated 6 months ago
- Korean language support for NNSVS/ENUNU☆27Updated last year
- Official Demo Page for DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer☆32Updated 2 months ago
- Multi-speaker Speech Synthesis Using VITS(KO, JA, EN, ZH)☆74Updated last year
- VALL-E 한국어 버전☆12Updated last year
- Korean read speech corpus (about 120 hours, 17GB) from National Institute of Korean Language☆43Updated 7 years ago
- Updated folk of g2pk☆11Updated last year
- Render wav and convert it with [Diff-SVC](https://github.com/prophesier/diff-svc) model☆10Updated 2 years ago
- 'Grad-TTS' with Multilingual Cleaners☆10Updated last year
- MB-iSTFT-VITS2(Data Preprocessing + Whisper + Text Preprocessing + Making config.json + Training, Inference) ONE-CLICK☆12Updated last year
- ☆29Updated last year