roedoejet / FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
☆22Updated last year
Related projects ⓘ
Alternatives and complementary repositories for FastSpeech2
- Predict prosody labels for Chinese sentences.☆40Updated 2 years ago
- The code for aishell-3 baseline acoustic model☆68Updated 3 years ago
- Huawei Grad-TTS for Chinese☆45Updated last year
- ☆74Updated 2 years ago
- ☆65Updated last year
- The Implementation of FastSpeech2 Based on Pytorch.☆52Updated last year
- TransferTTS (Zero-Shot learning of VITS)☆90Updated 2 years ago
- TTS-frontend with Bert and CRF/lstm (For Tacotron)☆50Updated 4 years ago
- Target Speaker Extraction Toolkit☆112Updated 2 weeks ago
- MagicData-RAMC Dataset and Baseline☆50Updated 2 years ago
- ☆16Updated 2 years ago
- ☆69Updated 3 years ago
- Chinese Text Normalization and Dataset☆81Updated 2 years ago
- Implementation of TTS with combination of Tacotron2 and HiFi-GAN☆9Updated 2 years ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆81Updated last year
- Chinese and English Bilinguish G2P☆20Updated last year
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆112Updated 2 years ago
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆84Updated 9 months ago
- An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"☆96Updated 2 years ago
- Prosody Predict☆10Updated 3 years ago
- Materials accompanying the paper "Phonological features for 0-shot multilingual speech synthesis"☆32Updated 4 years ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English☆74Updated this week
- ☆34Updated 3 years ago
- The official implementation of EmoSphere-TTS☆85Updated 3 months ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆112Updated 9 months ago
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.☆35Updated 2 years ago
- ☆109Updated 2 years ago
- ☆47Updated 2 weeks ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- ☆55Updated last year