KinglittleQ / GST-TacotronLinks

A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis

☆368

Alternatives and similar repositories for GST-Tacotron

Users that are interested in GST-Tacotron are comparing it to the libraries listed below

Sorting:

jxzhanggg / nonparaSeq2seqVC_code
Implementation code of non-parallel sequence-to-sequence VC
☆248Updated 2 years ago
liusongxiang / ppg-vc
PPG-Based Voice Conversion
☆341Updated 3 years ago
bshall / UniversalVocoding
A PyTorch implementation of "Robust Universal Neural Vocoding"
☆239Updated 4 years ago
yistLin / FragmentVC
Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention
☆202Updated 4 years ago
keonlee9420 / Parallel-Tacotron2
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
☆191Updated 3 years ago
bshall / ZeroSpeech
VQ-VAE for Acoustic Unit Discovery and Voice Conversion
☆337Updated 2 years ago
Wendison / VQMIVC
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
☆351Updated 3 years ago
syang1993 / gst-tacotron
A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"
☆367Updated 6 years ago
MattShannon / mcd
Mel cepstral distortion (MCD) computations in python.
☆225Updated 8 years ago
tts-tutorial / survey
A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf
☆369Updated 3 years ago
guanlongzhao / fac-via-ppg
Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)
☆143Updated 2 years ago
yanggeng1995 / GAN-TTS
A pytroch implementation of the GAN-TTS: HIGH FIDELITY SPEECH SYNTHESIS WITH ADVERSARIAL NETWORKS
☆231Updated 5 years ago
nii-yamagishilab / multi-speaker-tacotron
VCTK multi-speaker tacotron for ICASSP 2020
☆266Updated 3 years ago
rishikksh20 / FastSpeech2
PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech
☆229Updated 3 years ago
jjery2243542 / adaptive_voice_conversion
☆477Updated 4 years ago
jinhan / tacotron2-vae
Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"
☆168Updated 2 years ago
janvainer / speedyspeech
☆259Updated 2 years ago
keonlee9420 / Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration…
☆326Updated 2 years ago
BogiHsu / Tacotron2-PyTorch
Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.
☆148Updated 3 years ago
numediart / EmoV-DB
The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems
☆271Updated last year
yistLin / dvector
Speaker embedding (d-vector) trained with GE2E loss
☆282Updated last year
hujinsen / pytorch-StarGAN-VC
Fully reproduce the paper of StarGAN-VC. Stable training and Better audio quality .
☆248Updated last year
lochenchou / MOSNet
Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"
☆372Updated last year
rishikksh20 / VocGAN
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
☆320Updated 11 months ago
facebookresearch / speech-resynthesis
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…
☆407Updated last year
HLTSingapore / Emotional-Speech-Data
This is the GitHub page for publicly available emotional speech data.
☆357Updated 3 years ago
KevinMIN95 / StyleSpeech
Official implementation of Meta-StyleSpeech and StyleSpeech
☆249Updated 3 years ago
keonlee9420 / Expressive-FastSpeech2
PyTorch Implementation of Non-autoregressive Expressive (emotional, conversational) TTS based on FastSpeech2, supporting English, Korean,…
☆304Updated 3 years ago
bigpon / vcc20_baseline_cyclevae
Voice Conversion Challenge 2020 CycleVAE baseline system
☆131Updated 4 years ago
KunZhou9646 / emotional-voice-conversion-with-CycleGAN-and-CWT-for-Spectrum-and-F0
This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-…
☆125Updated 4 years ago