roedoejet / FastSpeech2Links

An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"

☆22

Alternatives and similar repositories for FastSpeech2

Users that are interested in FastSpeech2 are comparing it to the libraries listed below

Sorting:

sos1sos2Sixteen / aishell-3-baseline-fc
The code for aishell-3 baseline acoustic model
☆68Updated 4 years ago
thuhcsi / SpanPSP
☆76Updated 3 years ago
MaxMax2016 / Grad-TTS-Chinese
Huawei Grad-TTS for Chinese
☆50Updated last year
pengzhendong / g2p-mix
Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.
☆105Updated 3 months ago
hcy71o / TransferTTS
TransferTTS (Zero-Shot learning of VITS)
☆100Updated 2 years ago
KeSpeech / KeSpeech
The repo provides information about KeSpeech dataset.
☆146Updated 2 years ago
thuhcsi / LightGrad
☆65Updated last year
k2-fsa / next-gen-kaldi-wechat
☆38Updated 11 months ago
audeering / w2v2-age-gender-how-to
How to use our public wav2vec2 age and gender model
☆46Updated last year
imdanboy / jets
JETS: Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech
☆110Updated 3 years ago
MagicHub-io / MagicData-RAMC
MagicData-RAMC Dataset and Baseline
☆54Updated 2 years ago
thuhcsi / Crystal
Crystal - C++ implementation of a unified framework for multilingual TTS synthesis engine with SSML specification as interface.
☆229Updated 4 years ago
tuanh123789 / AdaSpeech
An implementation of Microsoft's "AdaSpeech: Adaptive Text to Speech for Custom Voice"
☆97Updated 3 years ago
thuhcsi / FlatTN
Chinese Text Normalization and Dataset
☆84Updated 3 years ago
pengzhendong / pysilero
Python Wrapper of Silero VAD
☆56Updated 2 months ago
Jackiexiao / tts-frontend-dataset
TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization
☆99Updated last year
zycv / OpenSpeaker
OpenSpeaker is a completely independent and open source speaker recognition project. It provides the entire process of speaker recognitio…
☆64Updated 3 years ago
Zeqiang-Lai / Prosody_Prediction
Predict prosody labels for Chinese sentences.
☆41Updated 3 years ago
ga642381 / FastSpeech2
Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech
☆96Updated 2 years ago
wenet-e2e / wesep
Target Speaker Extraction Toolkit
☆180Updated 2 weeks ago
wespeech / awesome-tts
☆21Updated 3 years ago
KunZhou9646 / Mixed_Emotions
☆120Updated 2 years ago
xcmyz / FastSpeech2
The Implementation of FastSpeech2 Based on Pytorch.
☆52Updated 2 years ago
espnet / espnet_tts_frontend
Text frontend for ESPnet tts recipes
☆34Updated 4 years ago
aizhiqi-work / MM-KWS
Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"
☆33Updated 2 months ago
RevoSpeechTech / speech-datasets-collection
a curated list of speech datasets (110+ datasets, 75+ easy to download)
☆140Updated 2 years ago
nwpuaslp / TTS_Course
☆69Updated 4 years ago
yinruiqing / tiny-transducer
Tiny Transducer: A Highly-Efficient Speech Recognition Model on Edge Devices
☆24Updated 2 years ago
papercup-open-source / phonological-features
Materials accompanying the paper "Phonological features for 0-shot multilingual speech synthesis"
☆33Updated 4 years ago
wenet-e2e / opencpop
Opencpop: A High-Quality Open Source Chinese Popular Song Database for Singing Voice Synthesis
☆222Updated 2 years ago