SandyPanda-MLDL / -Evaluation-Metrics-Used-For-The-Performance-Evaluation-of-Voice-Conversion-VC-ModelsLinks
Evaluation Metrics Used For The Performance Evaluation of Voice Conversion (VC) Models
☆19Updated 3 months ago
Alternatives and similar repositories for -Evaluation-Metrics-Used-For-The-Performance-Evaluation-of-Voice-Conversion-VC-Models
Users that are interested in -Evaluation-Metrics-Used-For-The-Performance-Evaluation-of-Voice-Conversion-VC-Models are comparing it to the libraries listed below
Sorting:
- Whisper to Normal Speech Conversion with SC-MelGAN and SC-VQ-VAE☆14Updated 2 years ago
- ☆21Updated 4 years ago
- Nonparallel Emotional Speech Conversion with MUNIT. Introduction: This is a tensorflow implementation of paper(https://arxiv.org/pdf/1811…☆15Updated 3 years ago
- Code for Interspeech2022 paper DeID-VC: Speaker De-identification via Zero-shot Pseudo Voice Conversion☆13Updated 2 years ago
- Voice emotion conversion model for DS/ML master's thesis. F0 contour mapping in sequence-to-sequence RNN-LSTM architecture in Tensorflow.☆27Updated 6 years ago
- Voice Alignment and Conversion with Neural Networks and the WORLD codec.☆20Updated 6 years ago
- Source code and demo for INTERPSEECH 2023 paper: DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion P…☆37Updated last year
- Pytorch implementation of "f0-consistent many-to-many non-parallel voice conversion via conditional autoencoder"☆29Updated 4 years ago
- This is the implementation of the paper "Emotion Intensity and its Control for Emotional Voice Conversion".☆92Updated 3 years ago
- ☆30Updated 2 years ago
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …☆57Updated last month
- Speech Representation Disentanglement with Adversarial Mutual Information Learning for One-shot Voice Conversion (Interspeech 2022)☆119Updated last year
- ☆58Updated last year
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated 2 years ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆21Updated 2 years ago
- ☆35Updated 2 years ago
- ☆78Updated 8 months ago
- A real-time voice conversion model based on VITS.☆11Updated last year
- Speech Human Evaluation Estimation Toolkit (SHEET)☆116Updated last week
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆60Updated 8 months ago
- A Pytorch implementation of StarGAN-VC2☆17Updated 5 years ago
- Objective metrics used in several text-to-speech (TTS) papers.☆50Updated 3 months ago
- Any-to-one voice conversion using the data augment strategy: pitch shifted and duration remained.☆33Updated 3 years ago
- Voice Conversion method based on speaker style☆14Updated 4 years ago
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆23Updated last year
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆86Updated 2 years ago
- The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…☆64Updated last week
- This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.…☆104Updated 2 years ago
- The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"☆34Updated last year
- Official repository for FlowSE (Interspeech 2025)☆49Updated 3 months ago