roedoejet / FastSpeech2Links
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
☆22Updated 2 years ago
Alternatives and similar repositories for FastSpeech2
Users that are interested in FastSpeech2 are comparing it to the libraries listed below
Sorting:
- Target Speaker Extraction Toolkit☆204Updated 3 weeks ago
- Production First and Production Ready End-to-End Text-to-Speech Toolkit☆397Updated last year
- The repo provides information about KeSpeech dataset.☆157Updated 3 years ago
- 基于PyTorch的VITS-BigVGAN的tts中文模型,加入韵律预测模型。☆196Updated 3 years ago
- ☆21Updated 3 years ago
- Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis☆227Updated 2 years ago
- Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.☆229Updated 5 years ago
- Speaker verification using ResnetSE (EER=0.0093) and ECAPA-TDNN☆96Updated 4 years ago
- PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised T…☆194Updated 2 years ago
- Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…☆206Updated 2 months ago
- Collection of pretrained models for the Montreal Forced Aligner☆172Updated 3 weeks ago
- Unoffical implementation of Megatts2☆287Updated last year
- Huawei Grad-TTS for Chinese☆51Updated 2 years ago
- Easy-to-Use Speech MOS predictors☆320Updated 2 years ago
- It's a repository for implementations of neural speech editing algorithms.☆200Updated last year
- Python Wrapper of Silero VAD☆60Updated 5 months ago
- An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement☆175Updated last month
- PPG-Based Voice Conversion☆348Updated 3 years ago
- The project is associated with the recently-launched ICASSP 2022 Multi-channel Multi-party Meeting Transcription Challenge (M2MeT) to pro…☆129Updated 3 years ago
- 基于标贝数据继续训练,同时对原本的FastSpeech2模型做了改进,引入了韵律表征以及韵律预测模块,使中文发音更生动且富有节奏☆271Updated 2 years ago
- OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognitio…☆66Updated 3 years ago
- ☆256Updated 2 years ago
- ☆49Updated 2 years ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆110Updated 7 months ago
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆263Updated 3 months ago
- The code for aishell-3 baseline acoustic model☆69Updated 4 years ago
- ☆76Updated 3 years ago
- FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3☆218Updated last year
- Some basic praat scripts.☆221Updated last year
- UT-Sarulab MOS prediction system using SSL models☆274Updated last year