dathudeptrai / FastSpeech2
A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
☆11Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for FastSpeech2
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 3 years ago
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆45Updated 2 years ago
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.☆14Updated 3 years ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Updated 3 years ago
- 60k hours of phoneme-aligned audio from audio books☆18Updated 3 months ago
- Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.☆29Updated 10 months ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2(+Conformer) and HiFi-GAN for End to End Text to Speech☆46Updated 2 years ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆21Updated 2 months ago
- new version of tacotron2 (old version: https://github.com/xcmyz/Tacotron2-Pytorch)☆8Updated 3 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 2 years ago
- WIP Tensorflow implementation of https://github.com/mozilla/TTS☆15Updated 4 years ago
- ☆19Updated last year
- RepVgg + HiFiGAN☆33Updated 2 years ago
- ☆15Updated 3 years ago
- Singing Voice Speech modeling test☆35Updated 2 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Updated 4 years ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆22Updated 2 years ago
- with alignment learning and continuous wavelet transform☆19Updated 2 years ago
- Google's TPGST reimplementation.☆34Updated 4 years ago
- ☆64Updated 2 years ago
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆19Updated last year
- ☆25Updated last year
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Updated 3 years ago
- This is an unofficial implementation of universal melgan according to https://arxiv.org/abs/2011.09631☆23Updated 2 years ago
- ☆13Updated 2 years ago