bshall / ZeroSpeechLinks

VQ-VAE for Acoustic Unit Discovery and Voice Conversion

☆340

Alternatives and similar repositories for ZeroSpeech

Users that are interested in ZeroSpeech are comparing it to the libraries listed below

Sorting:

bshall / UniversalVocoding
A PyTorch implementation of "Robust Universal Neural Vocoding"
☆238Updated 5 years ago
rishikksh20 / VocGAN
VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network
☆321Updated last year
yistLin / FragmentVC
Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention
☆203Updated 5 years ago
jxzhanggg / nonparaSeq2seqVC_code
Implementation code of non-parallel sequence-to-sequence VC
☆248Updated 2 years ago
Wendison / VQMIVC
Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!
☆356Updated 3 years ago
KinglittleQ / GST-Tacotron
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
☆369Updated 2 years ago
lmnt-com / wavegrad
A fast, high-quality neural vocoder.
☆294Updated 2 years ago
ivanvovk / WaveGrad
Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.
☆404Updated 4 years ago
facebookresearch / speech-resynthesis
An official reimplementation of the method described in the INTERSPEECH 2021 paper - Speech Resynthesis from Discrete Disentangled Self-S…
☆413Updated 2 years ago
rishikksh20 / FastSpeech2
PyTorch Implementation of FastSpeech 2 : Fast and High-Quality End-to-End Text to Speech
☆232Updated 3 years ago
yanggeng1995 / GAN-TTS
A pytroch implementation of the GAN-TTS: HIGH FIDELITY SPEECH SYNTHESIS WITH ADVERSARIAL NETWORKS
☆232Updated 5 years ago
numediart / EmoV-DB
The Emotional Voices Database: Towards Controlling the Emotional Expressiveness in Voice Generation Systems
☆276Updated 2 years ago
keonlee9420 / Comprehensive-Transformer-TTS
A Non-Autoregressive Transformer based Text-to-Speech, supporting a family of SOTA transformers with supervised and unsupervised duration…
☆326Updated 3 years ago
nii-yamagishilab / multi-speaker-tacotron
VCTK multi-speaker tacotron for ICASSP 2020
☆266Updated 3 years ago
KevinMIN95 / StyleSpeech
Official implementation of Meta-StyleSpeech and StyleSpeech
☆252Updated 3 years ago
facebookresearch / vocoder-benchmark
A repository for benchmarking neural vocoders by their quality and speed.
☆212Updated 6 months ago
auspicious3000 / SpeechSplit
Unsupervised Speech Decomposition Via Triple Information Bottleneck
☆696Updated last year
bigpon / vcc20_baseline_cyclevae
Voice Conversion Challenge 2020 CycleVAE baseline system
☆131Updated 5 years ago
jinhan / tacotron2-vae
Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"
☆169Updated 2 years ago
BogiHsu / Tacotron2-PyTorch
Yet another PyTorch implementation of Tacotron 2 with reduction factor and faster training speed.
☆148Updated 3 years ago
k2kobayashi / crank
A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder
☆171Updated last year
liusongxiang / ppg-vc
PPG-Based Voice Conversion
☆348Updated 3 years ago
jjery2243542 / adaptive_voice_conversion
☆479Updated 5 years ago
tts-tutorial / survey
A Survey on Neural Speech Synthesis https://arxiv.org/pdf/2106.15561.pdf
☆371Updated 4 years ago
yistLin / dvector
Speaker embedding (d-vector) trained with GE2E loss
☆286Updated last year
facebookresearch / WavAugment
A library for speech data augmentation in time-domain
☆678Updated 4 years ago
keonlee9420 / Parallel-Tacotron2
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
☆190Updated 4 years ago
guanlongzhao / fac-via-ppg
Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)
☆148Updated 2 years ago
maum-ai / assem-vc
Official Code for Assem-VC @ICASSP2022
☆269Updated 3 years ago
maum-ai / cotatron
Official code for Cotatron @ INTERSPEECH 2020
☆214Updated last year