roedoejet / FastSpeech2Links
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
☆22Updated 2 years ago
Alternatives and similar repositories for FastSpeech2
Users that are interested in FastSpeech2 are comparing it to the libraries listed below
Sorting:
- The code for aishell-3 baseline acoustic model☆69Updated 5 years ago
- Huawei Grad-TTS for Chinese☆49Updated 2 years ago
- Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.☆229Updated 5 years ago
- Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis☆229Updated 2 years ago
- The repo provides information about KeSpeech dataset.☆164Updated 3 years ago
- ☆76Updated 3 years ago
- Forced Alignment-MFA☆49Updated 3 years ago
- ☆22Updated 3 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆96Updated 4 years ago
- Production First and Production Ready End-to-End Text-to-Speech Toolkit☆409Updated 2 weeks ago
- Python Wrapper of Silero VAD☆62Updated 6 months ago
- Collection of pretrained models for the Montreal Forced Aligner☆178Updated last month
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆110Updated 8 months ago
- Target Speaker Extraction Toolkit☆226Updated 2 months ago
- Chinese Text Normalization and Dataset☆86Updated 3 years ago
- 基于PyTorch的VITS-BigVGAN的tts中文模型,加入韵律预测模型。☆196Updated 3 years ago
- Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech☆98Updated 3 years ago
- It is a multi-lingual (97 languages) text content automatic recognition and segmentation tool.它是一个TTS多语言(97种语言)的混合文本内容自动识别和拆分工具。☆19Updated last year
- Unoffical implementation of Megatts2☆287Updated last year
- Predict prosody labels for Chinese sentences.☆41Updated 3 years ago
- MagicData-RAMC Dataset and Baseline☆56Updated 3 years ago
- Chinese and English Bilinguish G2P☆22Updated 2 years ago
- TransferTTS (Zero-Shot learning of VITS)☆101Updated 3 years ago
- ☆70Updated 5 years ago
- personal blog☆17Updated 3 years ago
- ☆68Updated 2 years ago
- An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement☆177Updated 3 months ago
- TTS-frontend with Bert and CRF/lstm (For Tacotron)☆53Updated 5 years ago
- It's a repository for implementations of neural speech editing algorithms.☆200Updated last year
- tacotron-2(pytorch) + melgan(pytorch) chinese TTS☆26Updated 2 years ago