nii-yamagishilab/mos-finetune-ssl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nii-yamagishilab/mos-finetune-ssl)

nii-yamagishilab / mos-finetune-ssl

☆112

Alternatives and similar repositories for mos-finetune-ssl

Users that are interested in mos-finetune-ssl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dhimasryan / MOSA-Net-Cross-Domain
View on GitHub
☆63May 31, 2024Updated 2 years ago
sarulab-speech / UTMOS22
View on GitHub
UT-Sarulab MOS prediction system using SSL models
☆309Apr 11, 2024Updated 2 years ago
unilight / LDNet
View on GitHub
Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"
☆68Dec 13, 2021Updated 4 years ago
unilight / sheet
View on GitHub
Speech Human Evaluation Estimation Toolkit (SHEET)
☆138Mar 31, 2026Updated 3 months ago
dhimasryan / TMHINT-QI-VoiceMOS2023
View on GitHub
☆17Oct 18, 2023Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
sky1456723 / Pytorch-MBNet
View on GitHub
A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK
☆62Sep 24, 2021Updated 4 years ago
NKU-HLT / MusicEval-baseline
View on GitHub
☆12Apr 18, 2025Updated last year
tarepan / SpeechMOS
View on GitHub
Easy-to-Use Speech MOS predictors
☆362Oct 24, 2023Updated 2 years ago
dhimasryan / SpeechAssessmentModels
View on GitHub
☆22Jan 18, 2024Updated 2 years ago
soham97 / PAM
View on GitHub
PAM is a no-reference audio quality metric for audio generation tasks
☆77Jul 19, 2024Updated 2 years ago
lochenchou / MOSNet
View on GitHub
Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"
☆379Jul 21, 2024Updated 2 years ago
sarulab-speech / UTMOSv2
View on GitHub
UTokyo-SaruLab MOS Prediction System
☆357Apr 2, 2026Updated 3 months ago
yangdongchao / ALMTokenizer
View on GitHub
The demo page for ALMTokenizer
☆59Apr 14, 2025Updated last year
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
dyahayumgw / HAAQI-Net
View on GitHub
HAAQI-Net is a novel DNN-based non-intrusive method for assessing music audio quality in hearing aid users.
☆18Sep 26, 2025Updated 10 months ago
line / LibriTTS-P
View on GitHub
LibriTTS-P: A Corpus with Speaking Style and Speaker Identity Prompts for Text-to-Speech and Style Captioning
☆161Jun 13, 2024Updated 2 years ago
unilight / s3prl-vc
View on GitHub
S3PRL-VC: A Voice Conversion Toolkit based on S3PRL
☆101Mar 15, 2026Updated 4 months ago
voidful / Codec-SUPERB
View on GitHub
Audio Codec Speech processing Universal PERformance Benchmark
☆308Jul 4, 2026Updated 3 weeks ago
youngsheen / GPST
View on GitHub
[ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer
☆70Nov 1, 2024Updated last year
soumimaiti / speechlmscore_tool
View on GitHub
☆34Nov 24, 2024Updated last year
NTIA / alignnet
View on GitHub
Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.
☆18Aug 1, 2025Updated 11 months ago
Takaaki-Saeki / DiscreteSpeechMetrics
View on GitHub
Reference-aware automatic speech evaluation toolkit
☆185Dec 5, 2024Updated last year
ex3ndr / supervoice-enhance
View on GitHub
Supervoice diffusion enhance
☆28Jul 15, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
hhguo / SoCodec
View on GitHub
Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications
☆92Dec 20, 2024Updated last year
khhungg / BSSE-SE
View on GitHub
Boosting Self-Supervised Embeddings for Speech Enhancement
☆47Jun 23, 2022Updated 4 years ago
gabrielmittag / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆963Dec 1, 2024Updated last year
ASLP-lab / Automatic-Song-Aesthetics-Evaluation-Challenge
View on GitHub
☆15Dec 14, 2025Updated 7 months ago
aliutkus / speechmetrics
View on GitHub
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
☆1,050Jul 5, 2023Updated 3 years ago
yangdongchao / SimpleSpeech
View on GitHub
The open source code for SimpleSpeech series
☆147Oct 8, 2024Updated last year
freds0 / CML-TTS-Dataset
View on GitHub
CML-TTS: A Multilingual Dataset for Speech Synthesis
☆36Jul 31, 2024Updated last year
walker-hyf / GPT-Talker
View on GitHub
Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)
☆78Nov 1, 2024Updated last year
3loi / NaturalVoices
View on GitHub
☆61Oct 22, 2025Updated 9 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
AndreevP / wvmos
View on GitHub
MOS score prediction by fine-tuned wav2vec2.0 model
☆180Oct 20, 2022Updated 3 years ago
p0p4k / pflowtts_pytorch
View on GitHub
Unofficial implementation of NVIDIA P-Flow TTS paper
☆228Dec 24, 2024Updated last year
ajd12342 / paraspeechcaps
View on GitHub
Codebase for 'Scaling Rich Style-Prompted Text-to-Speech Datasets'
☆165Mar 26, 2026Updated 4 months ago
wavlab-speech / versa
View on GitHub
Versatile Evaluation of Speech and Audio
☆425Jul 21, 2026Updated last week
walker-hyf / NCSSD
View on GitHub
Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)
☆61Nov 1, 2024Updated last year
facebookresearch / Noresqa
View on GitHub
This github repo is for Neurips 2021 and Interspeech 2022 papers on Non-Matching Reference based estimation of speech quality assessment.…
☆104May 9, 2023Updated 3 years ago
lifeiteng / naturalspeech3_facodec
View on GitHub
FACodec: Speech Codec with Attribute Factorization used for NaturalSpeech 3
☆254Apr 20, 2024Updated 2 years ago