dathudeptrai / FastSpeech2
A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech
☆11Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for FastSpeech2
- RepVgg + HiFiGAN☆33Updated 2 years ago
- Open Source Speech/Text Data on AI☆18Updated 2 years ago
- Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration☆34Updated 3 years ago
- This is the source code of the paper "Neural grapheme-to-phoneme conversion with pretrained grapheme models☆45Updated 2 years ago
- This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.☆14Updated 3 years ago
- ICASSP 2021 accepted papers in term of voice conversion (VC)☆18Updated 3 years ago
- Singing Voice Speech modeling test☆35Updated 2 years ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2(+Conformer) and HiFi-GAN for End to End Text to Speech☆46Updated 2 years ago
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆21Updated 3 months ago
- PPSpeech: Phrase based Parallel End-to-End TTS System☆35Updated 4 years ago
- ☆30Updated last year
- SC-CNN: Effective Speaker Conditioning Method for Zero-Shot Multi-Speaker Text-to-Speech Systems☆39Updated last year
- End-To-End SpeechSynthesis system with knowledge distillation☆16Updated 2 years ago
- ☆64Updated 2 years ago
- ☆56Updated last year
- ☆19Updated last year
- 60k hours of phoneme-aligned audio from audio books☆18Updated 3 months ago
- Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)☆54Updated 8 months ago
- ☆20Updated 2 years ago
- with alignment learning and continuous wavelet transform☆19Updated 2 years ago
- Mutiband version of HIFIGAN☆17Updated 4 years ago
- FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS☆21Updated 2 years ago
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆19Updated last year
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆22Updated 2 years ago
- ☆25Updated last year
- SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification☆30Updated last year
- This is a template for the Non-autoregressive Deep Learning-Based TTS model (in PyTorch).☆15Updated 3 years ago