soham97/PAM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/soham97/PAM)

soham97 / PAM

PAM is a no-reference audio quality metric for audio generation tasks

☆77

Alternatives and similar repositories for PAM

Users that are interested in PAM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

microsoft / NoAudioCaptioning
View on GitHub
Repository for "Training Audio Captioning Models without Audio"
☆10Sep 26, 2023Updated 2 years ago
facebookresearch / audiobox-aesthetics
View on GitHub
Unified automatic quality assessment for speech, music, and sound.
☆744Jun 5, 2025Updated last year
nii-yamagishilab / mos-finetune-ssl
View on GitHub
☆112Jun 14, 2023Updated 3 years ago
JozefColdenhoff / OpenACE
View on GitHub
☆11Aug 1, 2025Updated 11 months ago
Takaaki-Saeki / DiscreteSpeechMetrics
View on GitHub
Reference-aware automatic speech evaluation toolkit
☆185Dec 5, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
unilight / sheet
View on GitHub
Speech Human Evaluation Estimation Toolkit (SHEET)
☆137Mar 31, 2026Updated 3 months ago
yuwchen / InQSS
View on GitHub
☆15Oct 6, 2023Updated 2 years ago
soham97 / ADIFF
View on GitHub
Explaining audio differences using language
☆16Feb 11, 2025Updated last year
wavlab-speech / versa
View on GitHub
Versatile Evaluation of Speech and Audio
☆423Updated this week
alessandroragano / scoreq
View on GitHub
SCOREQ: Speech COntrastive REgression for Quality Assessment (NeurIPS 2024)
☆114Aug 1, 2025Updated 11 months ago
microsoft / fadtk
View on GitHub
A simple library for Fréchet Audio Distance (FAD) calculation
☆266Aug 22, 2025Updated 10 months ago
microsoft / AudioEntailment
View on GitHub
Audio Entailment: Deductive Reasoning for Audio Understanding
☆17Dec 10, 2024Updated last year
NKU-HLT / RAMP_MOS
View on GitHub
[IEEE TASLP] Retrieval-Augmented MOS Prediction with Prior Knowledge Integration
☆33Mar 23, 2025Updated last year
fcumlin / DNSMOSPro
View on GitHub
Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).
☆98Jun 8, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
sarulab-speech / UTMOS22
View on GitHub
UT-Sarulab MOS prediction system using SSL models
☆309Apr 11, 2024Updated 2 years ago
Fraunhofer-IIS / ODAQ
View on GitHub
A collection of audio signals accompanied by corresponding subjective scores of perceived quality. Everything under permissive licenses.
☆53Feb 24, 2026Updated 4 months ago
ilpoviertola / V-AURA
View on GitHub
The official implementation of V-AURA: Temporally Aligned Audio for Video with Autoregression (ICASSP 2025) (Oral)
☆35Feb 11, 2026Updated 5 months ago
lcn-kul / xls-r-analysis-sqa
View on GitHub
Analysis of XLS-R for Speech Quality Assessment
☆15Feb 10, 2025Updated last year
dhimasryan / TMHINT-QI-VoiceMOS2023
View on GitHub
☆17Oct 18, 2023Updated 2 years ago
dyahayumgw / HAAQI-Net
View on GitHub
HAAQI-Net is a novel DNN-based non-intrusive method for assessing music audio quality in hearing aid users.
☆18Sep 26, 2025Updated 9 months ago
ttsds / ttsds
View on GitHub
The TTSDS benchmark evaluates synthetic speech quality by considering prosody, speaker identity, and intelligibility, comparing these fac…
☆96Jul 7, 2026Updated 2 weeks ago
soham97 / mellow
View on GitHub
small audio language model for reasoning
☆88Dec 4, 2025Updated 7 months ago
alessandroragano / nomad
View on GitHub
NOMAD: Non-Matching Audio Distance (ICASSP 2024)
☆30Jun 17, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
NKU-HLT / SpeechLLM-as-Judges
View on GitHub
[ACL 2026]
☆24Dec 6, 2025Updated 7 months ago
NKU-HLT / MusicEval-baseline
View on GitHub
☆12Apr 18, 2025Updated last year
OFA-Sys / AIR-Bench
View on GitHub
AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension
☆133Dec 9, 2024Updated last year
Netflix-Skunkworks / listening-test-app
View on GitHub
☆21May 23, 2024Updated 2 years ago
soumimaiti / speechlmscore_tool
View on GitHub
☆34Nov 24, 2024Updated last year
unilight / LDNet
View on GitHub
Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"
☆68Dec 13, 2021Updated 4 years ago
sarulab-speech / UTMOSv2
View on GitHub
UTokyo-SaruLab MOS Prediction System
☆350Apr 2, 2026Updated 3 months ago
JasonSWFu / VQscore
View on GitHub
☆59Dec 2, 2024Updated last year
chomeyama / HN-UnifiedSourceFilterGAN
View on GitHub
☆88Nov 1, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
tarepan / SpeechMOS
View on GitHub
Easy-to-Use Speech MOS predictors
☆360Oct 24, 2023Updated 2 years ago
sony / bigvsan_eval
View on GitHub
Evaluation tool used in the BigVSAN paper
☆14Mar 22, 2024Updated 2 years ago
gudgud96 / frechet-audio-distance
View on GitHub
A lightweight library for Frechet Audio Distance calculation.
☆317Feb 11, 2026Updated 5 months ago
salu133445 / deepperformer
View on GitHub
Deep Performer: Score-to-audio music performance synthesis
☆47Jun 26, 2023Updated 3 years ago
facebookresearch / ears_dataset
View on GitHub
Expressive Anechoic Recordings of Speech (EARS)
☆221Jun 25, 2024Updated 2 years ago
walker-hyf / GPT-Talker
View on GitHub
Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)
☆78Nov 1, 2024Updated last year
kuielab / voice_datasets
View on GitHub
🔊 A comprehensive list of open-source datasets for voice and sound computing (50+ datasets).
☆20Apr 1, 2021Updated 5 years ago