inconnu11/Objective-evaluation_speech_synthesis

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/inconnu11/Objective-evaluation_speech_synthesis)

inconnu11 / Objective-evaluation_speech_synthesis

☆17

Alternatives and similar repositories for Objective-evaluation_speech_synthesis

Users that are interested in Objective-evaluation_speech_synthesis are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bastibe / MAPS-Scripts
View on GitHub
A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.
☆25Mar 29, 2021Updated 5 years ago
SandyPanda-MLDL / ALGAN-VC-Generated-Audio-Samples
View on GitHub
Generated Audio Samples by ALGAN-VC model are available in the folder
☆19Feb 25, 2022Updated 4 years ago
MingjieChen / VoiceConversionGANs
View on GitHub
GAN series for voice conversion on VCC2018 dataset
☆17Aug 27, 2020Updated 5 years ago
Lukelluke / MCD-MEL-CEPSTRAL-DISTANCE-MCD-application
View on GitHub
Mel cepstral distortion (MCD) computations in python. Use Merlin toolkit to convert .wav files to .gcm files. Work in all form of .wav fi…
☆22Sep 4, 2020Updated 5 years ago
KunZhou9646 / controllable_evc_code
View on GitHub
This is the code for controllable EVC framework for seen and unseen emotion generation.
☆45Nov 3, 2021Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
MU94W / TTS-Eval
View on GitHub
☆18Aug 9, 2018Updated 7 years ago
MingjieChen / LowResourceVC
View on GitHub
Voice conversion training with 109 speakers with limited training samples
☆35Dec 21, 2020Updated 5 years ago
stefantaubert / mel-cepstral-distance
View on GitHub
A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …
☆67Aug 24, 2025Updated 11 months ago
WangHelin1997 / Automatic_Speech_Annotator
View on GitHub
Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…
☆33Jun 14, 2024Updated 2 years ago
qiuqiangkong / sampleRNN_acoustic_scene_generation
View on GitHub
☆14Apr 18, 2019Updated 7 years ago
dipjyoti92 / StarGAN-Voice-Conversion-2
View on GitHub
A Pytorch implementation of StarGAN-VC2
☆17Jul 28, 2020Updated 5 years ago
sony / bigvsan_eval
View on GitHub
Evaluation tool used in the BigVSAN paper
☆14Mar 22, 2024Updated 2 years ago
suhitaghosh10 / emo-stargan
View on GitHub
Implementation of Emo-StarGAN
☆48Dec 19, 2023Updated 2 years ago
tarepan / VoiceConversionLab
View on GitHub
Collect Voice Conversion researches
☆97Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
KunZhou9646 / seq2seq-EVC
View on GitHub
This is the implementation of our Interspeech 2021 paper: Limited data emotional voice conversion leveraging text-to-speech: two-stage se…
☆87Dec 31, 2022Updated 3 years ago
erictarrbelmont / AET5895-SP21
View on GitHub
Repository of course materials for AET4220/5895 Audio Software Development Spring 2021
☆13Apr 9, 2021Updated 5 years ago
rhoposit / icassp2021
View on GitHub
☆15May 8, 2021Updated 5 years ago
jefflai108 / Unsupervised-TTS
View on GitHub
☆42Mar 25, 2022Updated 4 years ago
alpoktem / Prosograph
View on GitHub
A Visualizer for prosodically annotated speech corpora
☆12Oct 27, 2021Updated 4 years ago
ssmlkl / MnTTS2
View on GitHub
This is the experimental description of MnTTS2.
☆12Apr 11, 2024Updated 2 years ago
MrZhengXin / multi_intent_2022
View on GitHub
☆13Feb 15, 2023Updated 3 years ago
shackysureshot / Mel-Cepstral-Distortion
View on GitHub
Calculation of MCD (dB) between two speech waveforms
☆57Sep 26, 2020Updated 5 years ago
chaitanya100100 / Relative-Attributes-Zero-Shot-Learning
View on GitHub
Python Implementation of Visual Relative Attributes for Image Classification and Zero Shot Learning
☆22Jun 14, 2018Updated 8 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
SandyPanda-MLDL / -Evaluation-Metrics-Used-For-The-Performance-Evaluation-of-Voice-Conversion-VC-Models
View on GitHub
Evaluation Metrics Used For The Performance Evaluation of Voice Conversion (VC) Models
☆19Jul 8, 2025Updated last year
xinshengwang / ICASSP2021_paper_list-VC
View on GitHub
ICASSP 2021 accepted papers in term of voice conversion (VC)
☆18Apr 11, 2021Updated 5 years ago
monglechap / fluenttts
View on GitHub
FluentTTS: Text-dependent Fine-grained Style Control for Multi-style TTS
☆20Nov 15, 2022Updated 3 years ago
caizexin / tf_multispeakerTTS_fc
View on GitHub
the Tensorflow version of multi-speaker TTS training with feedback constraint
☆40Oct 12, 2020Updated 5 years ago
cnaigithub / SpeechDewarping
View on GitHub
Official implementation of "Unsupervised Pre-training for Data-Efficient Text-to-Speech on Low Resource Languages", ICASSP 2023
☆27Apr 27, 2023Updated 3 years ago
ubisoft / ubisoft-laforge-daft-exprt
View on GitHub
Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis
☆127Apr 8, 2023Updated 3 years ago
qiuqiangkong / dcase2019_task1
View on GitHub
☆20May 13, 2019Updated 7 years ago
honkkis / neural-amp-modeler-HD
View on GitHub
High Definition Neural network emulator for guitar amplifiers.
☆15Feb 9, 2025Updated last year
biggytruck / SpeechSplit2
View on GitHub
Official implementation of SpeechSplit2
☆135Oct 22, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Rongjiehuang / GenerSpeech
View on GitHub
PyTorch Implementation of GenerSpeech (NeurIPS'22): a text-to-speech model towards zero-shot style transfer of OOD custom voice.
☆333Feb 9, 2024Updated 2 years ago
ga642381 / RobustVC
View on GitHub
**ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…
☆24Sep 27, 2022Updated 3 years ago
HidekiKawahara / worldGUItools
View on GitHub
GUI tools for WORLD vocoder
☆22Dec 19, 2024Updated last year
bottlecapper / EmoCycleGAN
View on GitHub
Emotional Speech Conversion using Nonparallel Data
☆17Apr 10, 2019Updated 7 years ago
jasonppy / FaST-VGS-Family
View on GitHub
Transformer-based visually grounded speech models
☆19Sep 22, 2022Updated 3 years ago
KunZhou9646 / emotional-voice-conversion-with-CycleGAN-and-CWT-for-Spectrum-and-F0
View on GitHub
This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-…
☆124Dec 14, 2020Updated 5 years ago
bshall / VectorQuantizedCPC
View on GitHub
Vector-Quantized Contrastive Predictive Coding for Acoustic Unit Discovery and Voice Conversion
☆142Sep 1, 2020Updated 5 years ago