roedoejet / FastSpeech2Links
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
☆22Updated 2 years ago
Alternatives and similar repositories for FastSpeech2
Users that are interested in FastSpeech2 are comparing it to the libraries listed below
Sorting:
- Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.☆229Updated 5 years ago
- Production First and Production Ready End-to-End Text-to-Speech Toolkit☆395Updated last year
- ☆21Updated 3 years ago
- It is a multi-lingual (97 languages) text content automatic recognition and segmentation tool.它是一个TTS多语言(97种语言)的混合文本内容自动识别和拆分工具。☆18Updated last year
- The code for aishell-3 baseline acoustic model☆69Updated 4 years ago
- Forced Alignment-MFA☆46Updated 3 years ago
- ☆49Updated 2 years ago
- ☆76Updated 3 years ago
- Collection of pretrained models for the Montreal Forced Aligner☆163Updated 2 months ago
- Easy-to-Use Speech MOS predictors☆311Updated last year
- Target Speaker Extraction Toolkit☆195Updated last month
- The repo provides information about KeSpeech dataset.☆155Updated 2 years ago
- Huawei Grad-TTS for Chinese☆51Updated last year
- Kaldi-compatible online fbank extractor without external dependencies☆114Updated last week
- Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech☆97Updated 2 years ago
- a curated list of speech datasets (110+ datasets, 75+ easy to download)☆152Updated 2 years ago
- JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆111Updated 3 years ago
- An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement☆171Updated this week
- 基于PyTorch的VITS-BigVGAN的tts中文模型,加入韵律预测模型。☆196Updated 2 years ago
- Unoffical implementation of Megatts2☆286Updated last year
- 基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏☆270Updated last year
- Predict prosody labels for Chinese sentences.☆41Updated 3 years ago
- ☆121Updated 2 years ago
- Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis☆226Updated 2 years ago
- AdaSpeech: Adaptive Text to Speech for Custom Voice☆160Updated 4 years ago
- Chinese Text Normalization and Dataset☆85Updated 3 years ago
- FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3☆216Updated last year
- personal blog☆17Updated 3 years ago
- UT-Sarulab MOS prediction system using SSL models☆258Updated last year
- Python Wrapper of Silero VAD☆59Updated 3 months ago