xixi219/MOS

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xixi219/MOS)

xixi219 / MOS

The MOS system combines components from DNSMOS, NISQA, MOSSSL, and SIGMOS, using the librosa library to process audio waveforms.

☆31

Alternatives and similar repositories for MOS

Users that are interested in MOS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

osherbakov / MELPe_fxp
View on GitHub
☆11Mar 23, 2020Updated 6 years ago
MaiAnh871 / melpe
View on GitHub
☆10Apr 20, 2022Updated 4 years ago
sp-uhh / ears_benchmark
View on GitHub
Generation scripts for EARS-WHAM and EARS-Reverb
☆48Jul 4, 2025Updated last year
Levent9 / Zero-shot-FaceVC
View on GitHub
☆19Mar 2, 2024Updated 2 years ago
kjw11 / CSEnet-ASR
View on GitHub
Cross-Speaker Encoding Network for Multi-talker Speech Recognition
☆12Mar 14, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
fakerybakery / OpenF5-TTS
View on GitHub
(WIP) A retrain of F5-TTS on permissively-licensed data
☆14Apr 6, 2025Updated last year
OpenNSP / Hifi-vaegan
View on GitHub
☆47Aug 31, 2024Updated last year
ductuantruong / speaker_age_estimation_ssl_study
View on GitHub
[APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models
☆14Oct 19, 2022Updated 3 years ago
karchkha / MelSpec_GPT_VQVAE
View on GitHub
Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms
☆18Oct 8, 2023Updated 2 years ago
microsoft / Distill-MOS
View on GitHub
Distillation of Self-Supervised Representation-Based Speech Quality Assessment
☆49May 15, 2025Updated last year
lucasnewman / best-rq-pytorch
View on GitHub
Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.
☆135Sep 25, 2023Updated 2 years ago
Takaaki-Saeki / ssl_speech_restoration_v2
View on GitHub
☆17Dec 18, 2023Updated 2 years ago
SonyResearch / VRVQ
View on GitHub
Variable Bitrate Residual Vector Quantization for Audio Coding
☆54May 1, 2025Updated last year
huutuongtu / Lightvoc
View on GitHub
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18May 17, 2024Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
Miamoto / Conformer-NTM
View on GitHub
☆16Nov 9, 2023Updated 2 years ago
alessandroragano / scoreq
View on GitHub
SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)
☆114Aug 1, 2025Updated 11 months ago
facebookresearch / FlowDec
View on GitHub
An neural full-band audio codec for general audio sampled at 48 kHz with 7.5 kps or 4.5 kbps.
☆212Jun 22, 2026Updated last month
daihuangyu / speex_aec_kf
View on GitHub
speex aec kalman filter
☆15Mar 17, 2024Updated 2 years ago
haloboy777 / wav-to-pcm
View on GitHub
This contains python scripts for converting wav files to pcm data for further processing.
☆12May 26, 2017Updated 9 years ago
MachineLP / text_renderer
View on GitHub
Generate text images for training deep learning ocr model
☆10Oct 22, 2018Updated 7 years ago
YUCHEN005 / UNA-GAN
View on GitHub
Code for paper "Unsupervised Noise adaptation using Data Simulation"
☆14May 16, 2024Updated 2 years ago
soham97 / PAM
View on GitHub
PAM is a no-reference audio quality metric for audio generation tasks
☆77Jul 19, 2024Updated 2 years ago
Liu-Tianchi / Golden-Gemini-for-Speaker-Verification
View on GitHub
Official release of pretrained models and codes for 'Golden Gemini Is All You Need: Finding the Sweet Spots for Speaker Verification'
☆15Jan 20, 2025Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
lcn-kul / xls-r-analysis-sqa
View on GitHub
Analysis of XLS-R for Speech Quality Assessment
☆15Feb 10, 2025Updated last year
DanielLin94144 / Test-time-adaptation-ASR-SUTA
View on GitHub
Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…
☆23Apr 1, 2022Updated 4 years ago
milahu / opensubtitles-scraper-new-subs
View on GitHub
temporary files created by opensubtitles-scraper
☆18Feb 3, 2026Updated 5 months ago
YuejieGao / TG-CRITIC
View on GitHub
TG-CRITIC: A TIMBRE-GUIDED MODEL FOR REFERENCE-INDEPENDENT SINGING EVALUATION
☆18May 26, 2023Updated 3 years ago
AI-Hobbyist / StarRail_Voice_Sorting_Scripts
View on GitHub
☆13Apr 26, 2026Updated 2 months ago
SpeechResearch / speechresearch.github.io
View on GitHub
☆44Jun 10, 2024Updated 2 years ago
diggerdu / Bach-audio-super-resolution
View on GitHub
Bach:The learning based model for audio super resolution
☆10Jan 26, 2018Updated 8 years ago
leekh7411 / Audio-Super-Resolution-Python3-TF
View on GitHub
Audio Super Resolution in Python3 with Tensorflow 1.5.0 (ref. https://kuleshov.github.io/audio-super-res/)
☆12Jul 10, 2018Updated 8 years ago
audiolabs / MonteCarloRIRSimulation
View on GitHub
Room impulse response simulation for various array architectures using Monte-Carlo simulation and quaternions (Python)
☆18Feb 25, 2026Updated 4 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
anton-kashkin / hifi_vc
View on GitHub
☆25Jan 24, 2023Updated 3 years ago
liuhao-lh / SMD
View on GitHub
Pytorch implementation of 'Improving Self-supervised Lightweight Model Learning via Hard-aware Metric Distillation. In ECCV 2022'
☆11Mar 22, 2023Updated 3 years ago
stefantaubert / mel-cepstral-distance
View on GitHub
A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …
☆67Aug 24, 2025Updated 10 months ago
kaistmm / AlignDiT
View on GitHub
[ACM MM 2025] AlignDiT: Multimodal Aligned Diffusion Transformer for Synchronized Speech Generation
☆24Oct 28, 2025Updated 8 months ago
chimechallenge / chime-utils
View on GitHub
Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.
☆26Feb 25, 2025Updated last year
HauLiang / DAMAS-FISTA-Net
View on GitHub
Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming
☆21Aug 20, 2024Updated last year
zeroone-universe / RealTimeBWE
View on GitHub
Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"
☆41Oct 20, 2025Updated 9 months ago