begeekmyfriend / WaveRNN
WaveRNN Vocoder + TTS
☆16Updated 4 years ago
Alternatives and similar repositories for WaveRNN:
Users that are interested in WaveRNN are comparing it to the libraries listed below
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆61Updated 3 years ago
- ☆69Updated 4 years ago
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆69Updated 4 years ago
- Chinese Text Normalization and Dataset☆83Updated 2 years ago
- The Implementation of FastSpeech2 Based on Pytorch.☆52Updated last year
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated 2 years ago
- ☆43Updated 4 years ago
- Predict prosody labels for Chinese sentences.☆41Updated 2 years ago
- Official PyTorch implementation of Speaker Conditional WaveRNN☆110Updated 2 years ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Updated 4 years ago
- ☆75Updated 2 years ago
- PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.☆72Updated 3 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55Updated 4 years ago
- TTS-frontend with Bert and CRF/lstm (For Tacotron)☆52Updated 4 years ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2(+Conformer) and HiFi-GAN for End to End Text to Speech☆46Updated 2 years ago
- VAE Tacotron 2, an alternative of GST Tacotron☆88Updated last year
- The code for aishell-3 baseline acoustic model☆67Updated 4 years ago
- Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…☆17Updated 5 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆42Updated 4 years ago
- C++ implementation of End to End TTS which combines both Tacatron2 and LPCNET Vocoder.☆32Updated 5 years ago
- Official implementation of "WINVC: One-Shot Voice Conversion with Weight Adaptive Instance Normalization".☆30Updated 3 years ago
- Pitch estimation network (PiENet) for noise-robust neural F0 estimation of speech signals☆50Updated 5 years ago
- This repository provides a multi-mode and multi-speaker expressive speech synthesis framework, including multi-attentive Tacotron, DurIAN…☆74Updated 2 years ago
- Implementation of the AlignTTS☆76Updated last year
- Tacotron2 with Global Style Tokens☆63Updated 5 years ago
- A pytroch implementation of the FB-MelGAN☆89Updated 4 years ago
- The official implementation of VAENAR-TTS, a VAE based non-autoregressive TTS model.☆145Updated 3 years ago
- ☆69Updated 4 years ago
- ☆31Updated 2 years ago
- Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.☆154Updated 3 years ago