roedoejet / FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
☆22Updated last year
Alternatives and similar repositories for FastSpeech2:
Users that are interested in FastSpeech2 are comparing it to the libraries listed below
- Chinese and English Bilinguish G2P☆20Updated last year
- ☆74Updated 2 years ago
- Predict prosody labels for Chinese sentences.☆40Updated 2 years ago
- ☆65Updated last year
- The code for aishell-3 baseline acoustic model☆67Updated 4 years ago
- ☆69Updated 4 years ago
- Huawei Grad-TTS for Chinese☆46Updated last year
- MagicData-RAMC Dataset and Baseline☆52Updated 2 years ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- Target Speaker Extraction Toolkit☆141Updated this week
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆93Updated 11 months ago
- ☆32Updated 2 years ago
- ☆63Updated 4 months ago
- Chinese Text Normalization and Dataset☆81Updated 2 years ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆82Updated 2 years ago
- ☆48Updated 2 months ago
- ☆88Updated last year
- ☆42Updated 4 years ago
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…☆79Updated last year
- How to use our public wav2vec2 age and gender model☆35Updated last year
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.☆35Updated 2 years ago
- Computes the Mel-Cepstral Distance of two WAV files based on the paper "Mel-Cepstral Distance Measure for Objective Speech Quality Assess…☆50Updated last month
- TTS-frontend with Bert and CRF/lstm (For Tacotron)☆52Updated 4 years ago
- TransferTTS (Zero-Shot learning of VITS)☆94Updated 2 years ago
- Went online decode demo☆29Updated 3 years ago
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆75Updated last year
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆121Updated 2 years ago
- kaldi cnn-tdnnf baseline☆13Updated 3 years ago
- Voice activity detection (VAD) paper and code(From 198*~ )and its classification.☆93Updated 11 months ago