ORI-Muchim / Grad-TTS
'Grad-TTS' with Multilingual Cleaners
☆10Updated 5 months ago
Related projects: ⓘ
- ☆10Updated last month
- 4G GPU & 10 Minutes for train☆12Updated last year
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆11Updated 5 months ago
- Aligner for text-to-speech☆15Updated 2 months ago
- GPT for FACodec☆13Updated 5 months ago
- ☆28Updated this week
- My vocoder experiments☆20Updated last month
- Unofficial pytorch implementation of VISinger: Variational Inference with Adversarial Learning for End-to-end Singing Voice Synthesis (IC…☆15Updated last year
- ☆26Updated this week
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆27Updated last month
- This is a TTS model based on VITS that can control the output speech emotion through natural language and control the speaker through ref…☆4Updated last month
- Zero-Shot Foreign Accent Conversion without a Native Reference☆27Updated 4 months ago
- ☆24Updated 2 months ago
- End-to-End SpeechSynthesis system with fastspeech2 & hifigan☆13Updated 2 years ago
- ☆14Updated 4 months ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated last year
- ☆27Updated 10 months ago
- ☆27Updated 6 months ago
- Incorporating AutoVocoder to MB-iSTFT-VITS☆44Updated last year
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆64Updated 5 months ago
- ☆33Updated 5 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 2 years ago
- An open-source Kazakh Emotional Text-to-Speech Dataset☆23Updated 5 months ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆22Updated 2 years ago
- ☆14Updated this week
- text to speech☆10Updated 6 months ago
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆40Updated 2 months ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆18Updated 2 months ago
- MFA acoustic model training based on Opencpop☆12Updated last year
- ☆25Updated last year