dathudeptrai / FastSpeech2
A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
☆11Updated 4 years ago
Alternatives and similar repositories for FastSpeech2:
Users that are interested in FastSpeech2 are comparing it to the libraries listed below
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Updated 3 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆19Updated 4 years ago
- with alignment learning and continuous wavelet transform☆20Updated 2 years ago
- RepVgg + HiFiGAN☆33Updated 2 years ago
- This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.☆14Updated 3 years ago
- Official implementation of DGP-based multi-speaker speech synthesis with PyTorch☆24Updated 3 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 3 years ago
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆46Updated 2 years ago
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆21Updated last year
- End-to-end diarization loss☆22Updated 3 years ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Updated 4 years ago
- Google's TPGST reimplementation.☆34Updated 5 years ago
- ☆25Updated 2 years ago
- new version of tacotron2 (old version: https://github.com/xcmyz/Tacotron2-Pytorch)☆8Updated 3 years ago
- ☆36Updated 2 years ago
- Ultrafast GAN based Vocoder for Text to Speech☆50Updated 2 years ago
- ☆24Updated 2 years ago
- ☆64Updated 2 years ago
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Updated last year
- Official implementation of "Unsupervised Pre-training for Data-Efficient Text-to-Speech on Low Resource Languages", ICASSP 2023☆26Updated last year
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆22Updated 2 years ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2(+Conformer) and HiFi-GAN for End to End Text to Speech☆46Updated 2 years ago
- DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code☆10Updated 3 years ago
- An unofficial implementation of https://arxiv.org/abs/2005.05106☆46Updated 4 years ago
- ☆56Updated 2 years ago
- unsupervised ASR (mainly phone classifier) using EODM and GAN☆12Updated 4 years ago
- ☆20Updated 2 years ago
- Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.☆29Updated last year