stefantaubert/mean-opinion-score

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/stefantaubert/mean-opinion-score)

stefantaubert / mean-opinion-score

Python library for calculating the mean opinion score and 95% confidence interval of the standard deviation of text-to-speech ratings according to Ribeiro et al. (2011).

☆24

Alternatives and similar repositories for mean-opinion-score

Users that are interested in mean-opinion-score are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Lukelluke / MCD-MEL-CEPSTRAL-DISTANCE-MCD-application
View on GitHub
Mel cepstral distortion (MCD) computations in python. Use Merlin toolkit to convert .wav files to .gcm files. Work in all form of .wav fi…
☆22Sep 4, 2020Updated 5 years ago
declare-lab / VIP
View on GitHub
Our EMNLP 2022 paper on VIP-Based Prompting for Parameter-Efficient Learning
☆10Oct 22, 2022Updated 3 years ago
JacobLinCool / zero-rvc
View on GitHub
Run Retrieval-based Voice Conversion training and inference with ease.
☆12Jan 24, 2025Updated last year
PlayVoice / VI-SVC
View on GitHub
VI-SVC model is just VITS without MAS and DurationPredictor.
☆10Nov 9, 2023Updated 2 years ago
wavlab-speech / cmu_multilingual_speech
View on GitHub
CMU multilingual speech repository
☆30Apr 15, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
lucasnewman / e2-tts-mlx
View on GitHub
Implementation of E2-TTS, "Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS", in MLX
☆21Oct 8, 2024Updated last year
MaxMax2016 / Glow-SVC
View on GitHub
4G GPU & 10 Minutes for train
☆12Aug 9, 2023Updated 2 years ago
jerryuhoo / VISinger
View on GitHub
Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.
☆39Feb 24, 2023Updated 3 years ago
horvathandris / dime
View on GitHub
An ISO-4217 currency library for Gleam
☆13Jun 30, 2026Updated last week
reppy4620 / x-vits
View on GitHub
☆14Aug 1, 2025Updated 11 months ago
SJTMusicTeam / MusicGeneration
View on GitHub
☆10May 15, 2021Updated 5 years ago
stefantaubert / mel-cepstral-distance
View on GitHub
A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …
☆67Aug 24, 2025Updated 10 months ago
ORI-Muchim / BARK-RVC
View on GitHub
Multilingual-Speech-Synthesis-Voice-Conversion Using Bark + RVC
☆14Apr 19, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
vtuber-plan / FlowVAE
View on GitHub
☆17Dec 12, 2023Updated 2 years ago
google-research-datasets / adversarial-nibbler
View on GitHub
This dataset contains results from all rounds of Adversarial Nibbler. This data includes adversarial prompts fed into public generative t…
☆27Feb 3, 2025Updated last year
andi611 / CS-Tacotron-Pytorch
View on GitHub
Pytorch implementation of CS-Tacotron, a code-switching speech synthesis end-to-end generative TTS model.
☆23Mar 14, 2019Updated 7 years ago
DLSeed / so-vits-svc-5.0
View on GitHub
Sovits5 with RMVPE
☆14Jul 17, 2023Updated 2 years ago
unilight / sheet
View on GitHub
Speech Human Evaluation Estimation Toolkit (SHEET)
☆136Mar 31, 2026Updated 3 months ago
MusicTextSynaesthesia / MusicTextSynaesthesia
View on GitHub
☆10Sep 17, 2022Updated 3 years ago
kdrkdrkdr / RVC-VITS
View on GitHub
Few-shot multilingual tts with RVC and Vits
☆50Jun 15, 2023Updated 3 years ago
mmorise / no7_singing
View on GitHub
☆14Oct 11, 2024Updated last year
datawhalechina / musiclm-universe
View on GitHub
Music Language Model Generation, Optimization, and Practice
☆61Apr 20, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
xkx-hub / ISCSLP2024_CoVoC_baseline
View on GitHub
☆13Jun 8, 2024Updated 2 years ago
Ereboas / TacoLM
View on GitHub
☆19May 2, 2024Updated 2 years ago
anonymous84654 / RAVE_anonymous
View on GitHub
☆14Mar 20, 2022Updated 4 years ago
XinhaoMei / audio-text_retrieval
View on GitHub
Implementation of our paper 'On Metric Learning For Audio-Text Cross-Modal Retrieval'
☆51May 17, 2022Updated 4 years ago
ishine / ContextNet
View on GitHub
Tensorflow2 based implementation of ContextNet, an improved convolutional rnn-transducer-based architecture for end-to-end speech recogni…
☆18Oct 19, 2020Updated 5 years ago
bigpon / SpeechSubjectiveTest
View on GitHub
Speech (audio) subjective evaluation system
☆42Jul 15, 2020Updated 5 years ago
X-LANCE / StoryTTS
View on GitHub
[ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations
☆141Apr 27, 2024Updated 2 years ago
wavlab-speech / shinjiwlab.github.io
View on GitHub
☆18Jun 22, 2026Updated 2 weeks ago
JabuMlDev / Speaker-VGG-CCT
View on GitHub
Official implementation of the paper "SPEAKER VGG CCT: Cross-corpus Speech Emotion Recognition with Speaker Embedding and Vision Transfor…
☆25Feb 17, 2023Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
wavlab-speech / versa
View on GitHub
Versatile Evaluation of Speech and Audio
☆419Updated this week
Tayjsl97 / RL-Chord
View on GitHub
This is the official implementation of RL-Chord (TNNLS).
☆13Jan 2, 2024Updated 2 years ago
archont94 / mutable-env
View on GitHub
Docker environment for Mutable Instruments modules hacking
☆11Feb 22, 2023Updated 3 years ago
TomJwYu / WenetSpeechSpeakerCluster
View on GitHub
☆56Jul 17, 2023Updated 2 years ago
microsoft / fadtk
View on GitHub
A simple library for Fréchet Audio Distance (FAD) calculation
☆265Aug 22, 2025Updated 10 months ago
xxayt / MGSV
View on GitHub
[ICCV 2025] This repo is the official implementation of "Music Grounding by Short Video"
☆27Sep 9, 2025Updated 10 months ago
B06901052 / DeepSpeed
View on GitHub
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
☆13Oct 11, 2022Updated 3 years ago