roedoejet / FastSpeech2Links
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
☆22Updated 2 years ago
Alternatives and similar repositories for FastSpeech2
Users that are interested in FastSpeech2 are comparing it to the libraries listed below
Sorting:
- Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.☆229Updated 5 years ago
- ☆22Updated 3 years ago
- Production First and Production Ready End-to-End Text-to-Speech Toolkit☆412Updated last month
- Target Speaker Extraction Toolkit☆241Updated 3 months ago
- The code for aishell-3 baseline acoustic model☆69Updated 5 years ago
- Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis☆232Updated last month
- Collection of pretrained models for the Montreal Forced Aligner☆184Updated 3 months ago
- The repo provides information about KeSpeech dataset.☆170Updated 3 years ago
- It's a repository for implementations of neural speech editing algorithms.☆201Updated 2 years ago
- Unoffical implementation of Megatts2☆287Updated last year
- PPG-Based Voice Conversion☆347Updated 3 years ago
- Forced Alignment-MFA☆50Updated 3 years ago
- ☆76Updated 3 years ago
- PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised T…☆194Updated 3 years ago
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆119Updated last year
- Easy-to-Use Speech MOS predictors☆340Updated 2 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆96Updated 4 years ago
- Huawei Grad-TTS for Chinese☆50Updated 2 years ago
- Some basic praat scripts.☆226Updated last year
- 基于PyTorch的VITS-BigVGAN的tts中文模型,加入韵律预测模型。☆197Updated 3 years ago
- AdaSpeech: Adaptive Text to Speech for Custom Voice☆162Updated 4 years ago
- Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…☆212Updated 5 months ago
- ☆121Updated 3 years ago
- Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech☆98Updated 3 years ago
- tacotron+griffin Lim synthetic mandarin voice☆26Updated 2 years ago
- An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement☆182Updated 4 months ago
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆270Updated 6 months ago
- ☆257Updated 2 years ago
- UT-Sarulab MOS prediction system using SSL models☆294Updated last year
- FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music gener…☆440Updated last year