MattShannon / mcdLinks

Mel cepstral distortion (MCD) computations in python.

☆227

Alternatives and similar repositories for mcd

Users that are interested in mcd are comparing it to the libraries listed below

Sorting:

SamuelBroughton / Mel-Cepstral-Distortion
Calculation of MCD (dB) between two speech waveforms
☆57Updated 5 years ago
rishikksh20 / Fre-GAN-pytorch
Fre-GAN: Adversarial Frequency-consistent Audio Synthesis
☆108Updated 4 years ago
ericwudayi / SkipVQVC
An implementation of SkipVQVC with various settings.
☆75Updated 5 years ago
guanlongzhao / fac-via-ppg
Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)
☆148Updated 2 years ago
jinhan / tacotron2-vae
Implementation of "Learning Latent Representations for Style Control and Transfer in End-to-end Speech Synthesis"
☆169Updated 2 years ago
jxzhanggg / nonparaSeq2seqVC_code
Implementation code of non-parallel sequence-to-sequence VC
☆248Updated 2 years ago
keonlee9420 / Parallel-Tacotron2
PyTorch Implementation of Google's Parallel Tacotron 2: A Non-Autoregressive Neural TTS Model with Differentiable Duration Modeling
☆190Updated 4 years ago
sarulab-speech / UTMOS22
UT-Sarulab MOS prediction system using SSL models
☆283Updated last year
KinglittleQ / GST-Tacotron
A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis
☆369Updated 2 years ago
lochenchou / MOSNet
Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"
☆376Updated last year
jinhan / tacotron2-gst
Tacotron2 with Global Style Tokens
☆65Updated 6 years ago
yistLin / FragmentVC
Any-to-any voice conversion by end-to-end extracting and fusing fine-grained voice fragments with attention
☆203Updated 5 years ago
xcmyz / FastVocoder
Include Basis-MelGAN, MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.
☆157Updated 4 years ago
YoungSeng / SRD-VC
Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)
☆119Updated last year
liusongxiang / ppg-vc
PPG-Based Voice Conversion
☆348Updated 3 years ago
ga642381 / FastSpeech2
Multi-Speaker Pytorch FastSpeech2: Fast and High-Quality End-to-End Text to Speech
☆98Updated 3 years ago
KrishnaDN / x-vector-pytorch
Implementation of the paper "Spoken Language Recognition using X-vectors" in Pytorch
☆105Updated 5 years ago
nii-yamagishilab / mos-finetune-ssl
☆108Updated 2 years ago
cvqluu / GE2E-Loss
Pytorch implementation of Generalized End-to-End Loss for speaker verification
☆87Updated 6 years ago
rishikksh20 / vae_tacotron2
VAE Tacotron 2, an alternative of GST Tacotron
☆89Updated 2 years ago
ttslr / python-MCD
☆49Updated 5 years ago
AndreevP / wvmos
MOS score prediction by fine-tuned wav2vec2.0 model
☆171Updated 3 years ago
zeroQiaoba / ivector-xvector
Extract xvector and ivector under kaldi
☆110Updated 7 years ago
idiap / acoustic-simulator
Implementation of audio degradation processes
☆105Updated 10 years ago
KunZhou9646 / Mixed_Emotions
☆121Updated 3 years ago
JeremyCCHsu / vae-npvc
Re-implementation the code used in Voice Conversion from Non-parallel Corpora Using Variational Auto-encoder
☆148Updated 6 years ago
dipjyoti92 / SC-WaveRNN
Official PyTorch implementation of Speaker Conditional WaveRNN
☆110Updated 3 years ago
rishikksh20 / hifigan-denoiser
HiFi-GAN: High Fidelity Denoising and Dereverberation Based on Speech Deep Features in Adversarial Networks
☆221Updated 4 years ago
KunZhou9646 / Emovox
This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".
☆93Updated 3 years ago
mycrazycracy / speaker-embedding-with-phonetic-information
The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"
☆45Updated 6 years ago