Mel cepstral distortion (MCD) computations in python. Use Merlin toolkit to convert .wav files to .gcm files. Work in all form of .wav files
☆21Sep 4, 2020Updated 5 years ago
Alternatives and similar repositories for MCD-MEL-CEPSTRAL-DISTANCE-MCD-application
Users that are interested in MCD-MEL-CEPSTRAL-DISTANCE-MCD-application are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆17Mar 24, 2022Updated 4 years ago
- ☆49May 3, 2020Updated 5 years ago
- GAN series for voice conversion on VCC2018 dataset☆17Aug 27, 2020Updated 5 years ago
- Calculation of MCD (dB) between two speech waveforms☆57Sep 26, 2020Updated 5 years ago
- Basic Tools☆13Dec 18, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- the Tensorflow version of multi-speaker TTS training with feedback constraint☆40Oct 12, 2020Updated 5 years ago
- Voice conversion (VC) investigation using three variants of VAE☆59Oct 28, 2019Updated 6 years ago
- Voice conversion training with 109 speakers with limited training samples☆35Dec 21, 2020Updated 5 years ago
- Official implementation of SpeechSplit2☆135Oct 22, 2022Updated 3 years ago
- Mel cepstral distortion (MCD) computations in python.☆230Jun 13, 2017Updated 8 years ago
- An implementation of SkipVQVC with various settings.☆75Jun 22, 2020Updated 5 years ago
- An implement of SPEECHSPLIT☆15Sep 12, 2020Updated 5 years ago
- Transfer Learning from Monolingual ASR to Transcription-free Cross-lingual Voice Conversion☆40Oct 22, 2022Updated 3 years ago
- Python library for calculating the mean opinion score and 95% confidence interval of the standard deviation of text-to-speech ratings acc…☆24Jan 31, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The code for aishell-3 baseline acoustic model☆69Nov 30, 2020Updated 5 years ago
- ☆23Dec 10, 2024Updated last year
- A pytorch implementation of StarGAN-VC2☆150Sep 11, 2020Updated 5 years ago
- An unofficial implementation of Vector Quantization Voice Conversion (VQVC).☆29Apr 12, 2021Updated 4 years ago
- Official implementation of "WINVC: One-Shot Voice Conversion with Weight Adaptive Instance Normalization".☆30Nov 13, 2021Updated 4 years ago
- Demo for DART, Audio Imagination workshop submission in NeurIPS 2024☆13Apr 15, 2025Updated 11 months ago
- This is a pytorch implementation of StarGAN-VC2.☆13Dec 17, 2019Updated 6 years ago
- ☆19Feb 28, 2018Updated 8 years ago
- pytorch implementation of "Emotional Voice Conversion using Multitask Learning with Text-to-Speech", Accepted to ICASSP 2020☆30Jul 6, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Evaluation Metrics Used For The Performance Evaluation of Voice Conversion (VC) Models☆19Jul 8, 2025Updated 8 months ago
- Blog of the LibreCV.org☆11May 17, 2021Updated 4 years ago
- Generated Audio Samples by ALGAN-VC model are available in the folder☆19Feb 25, 2022Updated 4 years ago
- ☆10Sep 17, 2022Updated 3 years ago
- A Pytorch implementation of StarGAN-VC2☆17Jul 28, 2020Updated 5 years ago
- Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion☆143Sep 1, 2020Updated 5 years ago
- Voice Conversion pipeline consisting of GE2E speaker encoder, AutoVC conversion model and MelGAN vocoder.☆23Jan 24, 2021Updated 5 years ago
- Creative Adversarial Network for generating Dance Music Rhythm Patterns☆10Nov 25, 2020Updated 5 years ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆24Mar 29, 2021Updated 4 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- A speech signal processing library in Python with emphasis on deep learning.☆31Jul 16, 2022Updated 3 years ago
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆68Jul 5, 2024Updated last year
- This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…☆87Dec 31, 2022Updated 3 years ago
- Emotional Speech Conversion using Nonparallel Data☆17Apr 10, 2019Updated 6 years ago
- An implement of "Phonetic Posteriorgrams based Many-to-Many Singing Voice Conversion via Adversarial Training"☆125Nov 4, 2020Updated 5 years ago
- ☆100Jul 22, 2021Updated 4 years ago
- Run Retrieval-based Voice Conversion training and inference with ease.☆11Jan 24, 2025Updated last year