The MOS system combines components from DNSMOS, NISQA, MOSSSL, and SIGMOS, using the librosa library to process audio waveforms.
☆31Feb 16, 2024Updated 2 years ago
Alternatives and similar repositories for MOS
Users that are interested in MOS are comparing it to the libraries listed below
Sorting:
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆15Aug 20, 2024Updated last year
- ☆12Mar 23, 2020Updated 5 years ago
- speex aec kalman filter☆15Mar 17, 2024Updated last year
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆11Mar 14, 2025Updated 11 months ago
- ☆10Apr 20, 2022Updated 3 years ago
- ☆16Nov 9, 2023Updated 2 years ago
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆41Oct 20, 2025Updated 4 months ago
- (WIP) A retrain of F5-TTS on permissively-licensed data☆13Apr 6, 2025Updated 10 months ago
- Official release of pretrained models and codes for 'Golden Gemini Is All You Need: Finding the Sweet Spots for Speaker Verification'☆15Jan 20, 2025Updated last year
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- ☆21Jul 29, 2024Updated last year
- ☆19Mar 2, 2024Updated 2 years ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- ☆47Aug 31, 2024Updated last year
- Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…☆20Apr 1, 2022Updated 3 years ago
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆107Aug 1, 2025Updated 7 months ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆24Feb 25, 2025Updated last year
- ☆25Jan 24, 2023Updated 3 years ago
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Jan 2, 2024Updated 2 years ago
- ☆26Nov 2, 2022Updated 3 years ago
- Keyword Spotting suitable for embedded devices.☆28Jun 22, 2020Updated 5 years ago
- ☆26Mar 20, 2024Updated last year
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …☆64Aug 24, 2025Updated 6 months ago
- EAQUAL stands for Evaluation Of Audio Quality. It's an objective measurement technique used to measure the quality of encoded/decoded aud…☆25Dec 21, 2017Updated 8 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Jun 16, 2022Updated 3 years ago
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆30Aug 2, 2025Updated 7 months ago
- ☆28Oct 7, 2025Updated 4 months ago
- PAM is a no-reference audio quality metric for audio generation tasks☆77Jul 19, 2024Updated last year
- Generation scripts for EARS-WHAM and EARS-Reverb☆42Jul 4, 2025Updated 8 months ago
- ☆28Dec 14, 2021Updated 4 years ago
- Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.☆132Sep 25, 2023Updated 2 years ago
- Unofficial SoundStream implementation of Pytorch with training code and 16kHz pretrained checkpoint☆78Feb 9, 2026Updated 3 weeks ago
- ☆30Jun 12, 2025Updated 8 months ago
- Test Framework for few-shot open set KWS☆41Nov 8, 2024Updated last year
- Bandwidth Extension of Historical Recordings using Generative Adversarial Networks☆35May 25, 2023Updated 2 years ago
- Compute distribution-based quality metrics for audio data using embeddings, with a focus on music.☆43Jan 15, 2026Updated last month
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 7 months ago
- StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation☆254Sep 13, 2024Updated last year