dathudeptrai / FastSpeech2
A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
☆11Updated 4 years ago
Alternatives and similar repositories for FastSpeech2:
Users that are interested in FastSpeech2 are comparing it to the libraries listed below
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 3 years ago
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆46Updated 3 years ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Updated 4 years ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2(+Conformer) and HiFi-GAN for End to End Text to Speech☆46Updated 2 years ago
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆21Updated last year
- This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.☆14Updated 3 years ago
- ☆25Updated 2 years ago
- new version of tacotron2 (old version: https://github.com/xcmyz/Tacotron2-Pytorch)☆8Updated 4 years ago
- RepVgg + HiFiGAN☆34Updated 2 years ago
- Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)☆54Updated last year
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆22Updated 2 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Updated 4 years ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Updated last year
- ☆20Updated 2 years ago
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Updated 4 years ago
- Singing Voice Speech modeling test☆35Updated 2 years ago
- ☆26Updated 2 years ago
- with alignment learning and continuous wavelet transform☆20Updated 2 years ago
- Google's TPGST reimplementation.☆34Updated 5 years ago
- This is an unofficial implementation of universal melgan according to https://arxiv.org/abs/2011.09631☆23Updated 2 years ago
- ☆19Updated 2 years ago
- ☆12Updated 3 months ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆19Updated 4 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 3 years ago
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated 2 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 2 years ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆25Updated 4 months ago
- Temporary anonymous version☆22Updated last year
- Curriculum Vitae of Quan Wang☆15Updated 3 months ago