hwRG / FastSpeech2-Pytorch-Korean-Multi-Speaker
Multi-Speaker FastSpeech2 applicable to Korean. Description about train and synthesize in detail.
☆8Updated 2 years ago
Related projects: ⓘ
- ☆11Updated last year
- Official Demo Page for DiTTo-TTS: Efficient and Scalable Zero-Shot Text-to-Speech with Diffusion Transformer☆28Updated 3 weeks ago
- Multi-speaker & Multi-style TTS☆28Updated 2 months ago
- A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.☆42Updated last year
- The Introduction of the OLKAVS Dataset☆30Updated 3 months ago
- ☆99Updated this week
- Official implementation of EdiTTS: Score-based Editing for Controllable Text-to-Speech (INTERSPEECH 2022)☆117Updated last year
- An unofficial implementation of Vector Quantization Voice Conversion (VQVC).☆29Updated 3 years ago
- Evaluation Protocol for Large-Scale Zero-Shot TTS Literature☆44Updated last month
- This is Pytorch Implementation of Google's Non-attentive Tacotron.☆57Updated last year
- FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS☆21Updated last year
- Korean Singing Voice Synthesis based on Auto-regressive Boundary Equilibrium GAN☆68Updated 3 years ago
- ☆28Updated last year
- Simple tool for speech dataset augmentation for modeling various prosodies.☆14Updated 3 years ago
- PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis☆41Updated 2 years ago
- ICASSP 2023 Accepted☆189Updated 4 months ago
- ☆60Updated last year
- Torch implementation of NANSY, Neural Analysis and Synthesis, arXiv:2110.14513☆63Updated last year
- A pakage for crawling audio from Youtube☆41Updated last year
- Bilingual-TTS (Japanese and Korean)☆26Updated last year
- ☆45Updated 7 months ago
- ☆25Updated last month
- ☆31Updated last year
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64Updated last year
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆70Updated last year
- ☆20Updated 4 months ago
- 발화자 지정 모듈☆18Updated 9 months ago
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs☆14Updated last year
- Repository for speech paper reading☆32Updated 3 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆67Updated last year