roedoejet / FastSpeech2Links
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
☆22Updated 2 years ago
Alternatives and similar repositories for FastSpeech2
Users that are interested in FastSpeech2 are comparing it to the libraries listed below
Sorting:
- The code for aishell-3 baseline acoustic model☆68Updated 4 years ago
- ☆76Updated 3 years ago
- Huawei Grad-TTS for Chinese☆50Updated last year
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆105Updated 3 months ago
- TransferTTS (Zero-Shot learning of VITS)☆100Updated 2 years ago
- The repo provides information about KeSpeech dataset.☆146Updated 2 years ago
- ☆65Updated last year
- ☆38Updated 11 months ago
- How to use our public wav2vec2 age and gender model☆46Updated last year
- JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆110Updated 3 years ago
- MagicData-RAMC Dataset and Baseline☆54Updated 2 years ago
- Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.☆229Updated 4 years ago
- An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"☆97Updated 3 years ago
- Chinese Text Normalization and Dataset☆84Updated 3 years ago
- Python Wrapper of Silero VAD☆56Updated 2 months ago
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆99Updated last year
- OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognitio…☆64Updated 3 years ago
- Predict prosody labels for Chinese sentences.☆41Updated 3 years ago
- Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech☆96Updated 2 years ago
- Target Speaker Extraction Toolkit☆180Updated 2 weeks ago
- ☆21Updated 3 years ago
- ☆120Updated 2 years ago
- The Implementation of FastSpeech2 Based on Pytorch.☆52Updated 2 years ago
- Text frontend for ESPnet tts recipes☆34Updated 4 years ago
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆33Updated 2 months ago
- a curated list of speech datasets (110+ datasets, 75+ easy to download)☆140Updated 2 years ago
- ☆69Updated 4 years ago
- Tiny Transducer: A Highly-Efficient Speech Recognition Model on Edge Devices☆24Updated 2 years ago
- Materials accompanying the paper "Phonological features for 0-shot multilingual speech synthesis"☆33Updated 4 years ago
- Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis☆222Updated 2 years ago