erogol / ParallelWaveGANLinks
ParallelWaveGAN adaptation for Mozilla TTS
☆15Updated 5 years ago
Alternatives and similar repositories for ParallelWaveGAN
Users that are interested in ParallelWaveGAN are comparing it to the libraries listed below
Sorting:
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…☆43Updated 4 years ago
- asr2k☆51Updated last year
- ☆18Updated 3 years ago
- Official PyTorch implementation of TTS Style Transfer☆24Updated 3 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆54Updated 2 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆31Updated 3 years ago
- Interspeech 2019 tutorial materials☆48Updated 5 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling☆37Updated 4 years ago
- ☆56Updated 6 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Updated 4 years ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆33Updated 11 months ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Updated 3 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 3 years ago
- Sound examples for the Neural Parametric Singing Synthesizer (NPSS)☆22Updated 3 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated 2 years ago
- ☆32Updated 3 years ago
- Contains code for our work on speech to singing conversion (ICASSP 2020)☆50Updated 4 years ago
- Text to Speech Synthesis based on controllable latent representation☆14Updated 5 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆121Updated 6 years ago
- Feature extractor for DL speech processing.☆66Updated 3 years ago
- Emotion detection in audio utilising self-supervised representations trained with Contrastive Predictive Coding (CPC).☆43Updated 3 years ago
- RawNet: Fast End-to-End Neural Vocoder☆42Updated 6 years ago
- An implementation of Tacotron and Tacotron2☆81Updated 3 years ago
- ☆56Updated 2 years ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆46Updated 4 years ago
- follow NVIDIA, simplify it and support data parallel.☆13Updated 5 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated 2 years ago
- Dataset release for Emotional TTS in Indian Accent☆40Updated 2 years ago