The MOS system combines components from DNSMOS, NISQA, MOSSSL, and SIGMOS, using the librosa library to process audio waveforms.
☆31Feb 16, 2024Updated 2 years ago
Alternatives and similar repositories for MOS
Users that are interested in MOS are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆11Mar 23, 2020Updated 6 years ago
- ☆10Apr 20, 2022Updated 4 years ago
- Learning an Interpretable End-to-End Network for Real-Time Acoustic Beamforming☆19Aug 20, 2024Updated last year
- ☆19Mar 2, 2024Updated 2 years ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆12Mar 14, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- (WIP) A retrain of F5-TTS on permissively-licensed data☆14Apr 6, 2025Updated last year
- speex aec kalman filter☆15Mar 17, 2024Updated 2 years ago
- ☆47Aug 31, 2024Updated last year
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Oct 8, 2023Updated 2 years ago
- Implementation of BEST-RQ - a model for self-supervised learning of speech signals using a random projection quantizer, in Pytorch.☆134Sep 25, 2023Updated 2 years ago
- Unofficial instructions for changing Python kernel version on Google Colab.☆25Apr 21, 2025Updated last year
- ☆27Dec 11, 2025Updated 4 months ago
- ☆16Dec 18, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- ☆19Jun 29, 2025Updated 10 months ago
- ☆14Jan 6, 2024Updated 2 years ago
- ☆16Nov 9, 2023Updated 2 years ago
- ☆21Jul 29, 2024Updated last year
- CK-NNTest: collaboratively validating, benchmarking and optimizing neural net operators across platforms, frameworks and datasets☆15Jul 10, 2021Updated 4 years ago
- Unofficial Pytorch Lightning Implementation of "Real-time Speech Frequency Bandwidth Extension"☆41Oct 20, 2025Updated 6 months ago
- Generation scripts for EARS-WHAM and EARS-Reverb☆44Jul 4, 2025Updated 10 months ago
- SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)☆108Aug 1, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for paper "Unsupervised Noise adaptation using Data Simulation"☆14May 16, 2024Updated last year
- PAM is a no-reference audio quality metric for audio generation tasks☆76Jul 19, 2024Updated last year
- Analysis of XLS-R for Speech Quality Assessment☆15Feb 10, 2025Updated last year
- Official release of pretrained models and codes for 'Golden Gemini Is All You Need: Finding the Sweet Spots for Speaker Verification'☆15Jan 20, 2025Updated last year
- temporary files created by opensubtitles-scraper☆17Feb 3, 2026Updated 3 months ago
- This contains python scripts for converting wav files to pcm data for further processing.☆12May 26, 2017Updated 8 years ago
- ☆25Dec 12, 2017Updated 8 years ago
- Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…☆22Apr 1, 2022Updated 4 years ago
- Bach:The learning based model for audio super resolution☆10Jan 26, 2018Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆44Jun 10, 2024Updated last year
- A Python library for computing the Mel-Cepstral Distance (Mel-Cepstral Distortion, MCD) between two inputs. This implementation is based …☆66Aug 24, 2025Updated 8 months ago
- ☆25Jan 24, 2023Updated 3 years ago
- Official code implementation of "MAD: A Military Audio Dataset for Situational Awareness and Surveillance"☆15Nov 26, 2025Updated 5 months ago
- Audio Super Resolution in Python3 with Tensorflow 1.5.0 (ref. https://kuleshov.github.io/audio-super-res/)☆12Jul 10, 2018Updated 7 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆24Feb 25, 2025Updated last year
- This comprehensive guide provides a universal process for preparing your own speech datasets and training a custom Text-to-Speech (TTS) m…☆26May 3, 2025Updated last year