ttslr/python-MCD

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ttslr/python-MCD)

ttslr / python-MCD

☆49

Alternatives and similar repositories for python-MCD

Users that are interested in python-MCD are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shackysureshot / Mel-Cepstral-Distortion
View on GitHub
Calculation of MCD (dB) between two speech waveforms
☆57Sep 26, 2020Updated 5 years ago
ajinkyakulkarni14 / ERISHA
View on GitHub
ERISHA is a mulitilingual multispeaker expressive speech synthesis framework. It can transfer the expressivity to the speaker's voice for…
☆44Dec 17, 2020Updated 5 years ago
Lukelluke / MCD-MEL-CEPSTRAL-DISTANCE-MCD-application
View on GitHub
Mel cepstral distortion (MCD) computations in python. Use Merlin toolkit to convert .wav files to .gcm files. Work in all form of .wav fi…
☆22Sep 4, 2020Updated 5 years ago
MattShannon / mcd
View on GitHub
Mel cepstral distortion (MCD) computations in python.
☆231Jun 13, 2017Updated 9 years ago
stefantaubert / mel-cepstral-distance
View on GitHub
A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …
☆67Aug 24, 2025Updated 10 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
bajibabu / postfilt_gan
View on GitHub
This is an implementation of "Generative adversarial network-based postfilter for statistical parametric speech synthesis"
☆16Jun 27, 2018Updated 8 years ago
jgarciapueyo / MelNet-SpeechGeneration
View on GitHub
Implementation of MelNet in PyTorch to generate high-fidelity audio samples
☆25Sep 16, 2020Updated 5 years ago
monglechap / fluenttts
View on GitHub
FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS
☆20Nov 15, 2022Updated 3 years ago
ttslr / MonTTS
View on GitHub
☆16Dec 23, 2021Updated 4 years ago
espnet / espnet_tts_frontend
View on GitHub
Text frontend for ESPnet tts recipes
☆35Jun 1, 2021Updated 5 years ago
caizexin / tf_multispeakerTTS_fc
View on GitHub
the Tensorflow version of multi-speaker TTS training with feedback constraint
☆40Oct 12, 2020Updated 5 years ago
MingjieChen / LowResourceVC
View on GitHub
Voice conversion training with 109 speakers with limited training samples
☆35Dec 21, 2020Updated 5 years ago
KunZhou9646 / controllable_evc_code
View on GitHub
This is the code for controllable EVC framework for seen and unseen emotion generation.
☆45Nov 3, 2021Updated 4 years ago
zhengmidon / singaligner
View on GitHub
a compact audio-to-phoneme aligner for singing voice
☆12Jan 17, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tarepan / VoiceConversionLab
View on GitHub
Collect Voice Conversion researches
☆97Updated this week
PlayVoice / BigVGAN
View on GitHub
BigVGAN with Neural Source-Filter
☆58Sep 21, 2023Updated 2 years ago
AppleHolic / multiband_melgan
View on GitHub
An unofficial implementation of https://arxiv.org/abs/2005.05106
☆50Mar 10, 2021Updated 5 years ago
ivanvovk / compressed-tacotron2-pytorch
View on GitHub
Compressed version of Tacotron 2 using Tensor Train + Waveglow.
☆22Dec 26, 2019Updated 6 years ago
Takaaki-Saeki / simplified_neural_source_filter
View on GitHub
PyTorch implementation of simplified neural source filter model (s-nsf)
☆14Aug 4, 2021Updated 4 years ago
sony / bigvsan_eval
View on GitHub
Evaluation tool used in the BigVSAN paper
☆14Mar 22, 2024Updated 2 years ago
gudgud96 / piano-synthesis
View on GitHub
Code accompanying ML4MD ICML 2020 paper - "Generative Modelling for Controllable Audio Synthesis of Expressive Piano Performance".
☆31Jul 22, 2020Updated 5 years ago
bastibe / MAPS-Scripts
View on GitHub
A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.
☆25Mar 29, 2021Updated 5 years ago
LEEYOONHYUNG / GraphTTS
View on GitHub
☆12Jul 6, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ericwudayi / SkipVQVC
View on GitHub
An implementation of SkipVQVC with various settings.
☆75Jun 22, 2020Updated 6 years ago
KunZhou9646 / emotional-voice-conversion-with-CycleGAN-and-CWT-for-Spectrum-and-F0
View on GitHub
This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-…
☆124Dec 14, 2020Updated 5 years ago
inconnu11 / Objective-evaluation_speech_synthesis
View on GitHub
☆17Mar 24, 2022Updated 4 years ago
KunZhou9646 / Speaker-independent-emotional-voice-conversion-based-on-conditional-VAW-GAN-and-CWT
View on GitHub
This is the implementation of our Interspeech 2020 paper "Converting anyone's emotion: towards speaker-independent emotional voice conver…
☆90Nov 13, 2020Updated 5 years ago
HarunoriKawano / BEST-RQ
View on GitHub
Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.
☆96May 25, 2023Updated 3 years ago
gabrielmittag / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆964Dec 1, 2024Updated last year
ttaoREtw / semi-tts
View on GitHub
Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation
☆39Jul 16, 2020Updated 6 years ago
line / promptttspp
View on GitHub
PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions
☆86Oct 11, 2024Updated last year
bachhavpramod / bandwidth_extension
View on GitHub
☆57Jul 5, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
BridgetteSong / BunchedLPCnet
View on GitHub
This repository provides UNOFFICIAL Bunched LPCNet implementations with Pytorch.
☆14Jun 17, 2021Updated 5 years ago
ANLGBOY / WaveNODE
View on GitHub
Pytorch Implementation of WaveNODE
☆64Sep 4, 2020Updated 5 years ago
maum-ai / cotatron
View on GitHub
Official code for Cotatron @ INTERSPEECH 2020
☆213Jul 25, 2024Updated last year
sos1sos2Sixteen / aishell-3-baseline-fc
View on GitHub
The code for aishell-3 baseline acoustic model
☆70Nov 30, 2020Updated 5 years ago
thuhcsi / icassp2021-emotion-tts
View on GitHub
Please visit: https://thuhcsi.github.io/icassp2021-emotion-tts/
☆34Mar 17, 2023Updated 3 years ago
rishikksh20 / PPSpeech
View on GitHub
PPSpeech: Phrase based Parallel End-to-End TTS System
☆35Aug 31, 2020Updated 5 years ago
lochenchou / MOSNet
View on GitHub
Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"
☆380Jul 21, 2024Updated 2 years ago