pranaymanocha/PerceptualAudio

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pranaymanocha/PerceptualAudio)

pranaymanocha / PerceptualAudio

Perceptual Metrics of Audio - perceptually relevant loss function. DPAM and CDPAM

☆382

Alternatives and similar repositories for PerceptualAudio

Users that are interested in PerceptualAudio are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

adrienchaton / PerceptualAudio_Pytorch
View on GitHub
Pytorch implementation of "A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences", Pranay Manocha et al. - un…
☆65Apr 2, 2020Updated 6 years ago
csteinmetz1 / auraloss
View on GitHub
Collection of audio-focused loss functions in PyTorch
☆874Jul 30, 2024Updated last year
aliutkus / speechmetrics
View on GitHub
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
☆1,050Jul 5, 2023Updated 3 years ago
KinWaiCheuk / nnAudio
View on GitHub
Audio processing by using pytorch 1D convolution network
☆1,129May 21, 2026Updated 2 months ago
facebookresearch / WavAugment
View on GitHub
A library for speech data augmentation in time-domain
☆689Aug 30, 2021Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
maum-ai / phaseaug
View on GitHub
ICASSP 2023 Accepted
☆191May 6, 2024Updated 2 years ago
iver56 / torch-audiomentations
View on GitHub
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
☆1,161Nov 24, 2025Updated 8 months ago
google / visqol
View on GitHub
Perceptual Quality Estimator for speech and audio
☆911May 17, 2025Updated last year
adefossez / julius
View on GitHub
Fast PyTorch based DSP for audio and 1D signals
☆460Jun 3, 2026Updated last month
FrancoisGrondin / BIRD
View on GitHub
Big Impulse Response Dataset
☆159Oct 19, 2022Updated 3 years ago
jmcasebeer / autodsp
View on GitHub
Train custom adaptive filter optimizers without hand tuning or extra labels.
☆67Oct 14, 2021Updated 4 years ago
lochenchou / MOSNet
View on GitHub
Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"
☆380Jul 21, 2024Updated 2 years ago
rishikksh20 / HiFiplusplus-pytorch
View on GitHub
HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement
☆160Jul 16, 2022Updated 4 years ago
gabrielmittag / NISQA
View on GitHub
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆963Dec 1, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
iver56 / audiomentations
View on GitHub
A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.
☆2,302Apr 13, 2026Updated 3 months ago
maxrmorrison / torchcrepe
View on GitHub
Pytorch implementation of the CREPE pitch tracker
☆523May 16, 2025Updated last year
ivanvovk / WaveGrad
View on GitHub
Implementation of WaveGrad high-fidelity vocoder from Google Brain in PyTorch.
☆409Jul 7, 2021Updated 5 years ago
brentspell / torch-yin
View on GitHub
Yin pitch estimator in PyTorch
☆119Nov 7, 2022Updated 3 years ago
sigsep / norbert
View on GitHub
Painless Wiener filters for audio separation
☆191May 4, 2026Updated 2 months ago
descriptinc / cargan
View on GitHub
Official repository for the paper "Chunked Autoregressive GAN for Conditional Waveform Synthesis"
☆193Dec 8, 2022Updated 3 years ago
schmiph2 / pysepm
View on GitHub
Python implementation of performance metrics in Loizou's Speech Enhancement book
☆456Feb 15, 2025Updated last year
DavidDiazGuerra / gpuRIR
View on GitHub
Python library for Room Impulse Response (RIR) simulation with GPU acceleration
☆607Jul 18, 2025Updated last year
lmnt-com / wavegrad
View on GitHub
A fast, high-quality neural vocoder.
☆299Jul 18, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ifnspaml / Perceptual-Weighting-Filter-Loss
View on GitHub
A perceptual weighting filter loss for DNN training in speech enhancement
☆24Apr 30, 2022Updated 4 years ago
TEAMuP-dev / audacitorch
View on GitHub
PyTorch wrappers for using your model in audacity!
☆181Aug 13, 2023Updated 2 years ago
ViEm-ccy / GEDLoss_pytorch
View on GitHub
a pytorch implementation of Google GEDLoss
☆32Dec 9, 2020Updated 5 years ago
gudgud96 / frechet-audio-distance
View on GitHub
A lightweight library for Frechet Audio Distance calculation.
☆317Feb 11, 2026Updated 5 months ago
descriptinc / descript-audio-codec
View on GitHub
State-of-the-art audio codec with 90x compression factor. Supports 44.1kHz, 24kHz, and 16kHz mono/stereo audio.
☆1,841Jul 16, 2026Updated last week
jzmzhong / Automatic-Prosody-Annotator-with-SSWP-CLAP
View on GitHub
An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).
☆50Jun 11, 2024Updated 2 years ago
qiuqiangkong / torchlibrosa
View on GitHub
☆512Jun 25, 2024Updated 2 years ago
asteroid-team / asteroid
View on GitHub
The PyTorch-based audio source separation toolkit for researchers
☆2,578May 13, 2026Updated 2 months ago
ludlows / PESQ
View on GitHub
PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)
☆631Mar 18, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
zceng / LVCNet
View on GitHub
LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation
☆80Feb 24, 2021Updated 5 years ago
etzinis / sudo_rm_rf
View on GitHub
Code for SuDoRm-Rf networks for efficient audio source separation. SuDoRm-Rf stands for SUccessive DOwnsampling and Resampling of Multi-R…
☆336Jul 6, 2023Updated 3 years ago
audiolabs / torch-pesq
View on GitHub
PyTorch implementation of the Perceptual Evaluation of Speech Quality for wideband audio
☆228Jul 14, 2023Updated 3 years ago
Aworselife / DPTBF
View on GitHub
☆17Sep 12, 2023Updated 2 years ago
tts-tutorial / interspeech2022
View on GitHub
☆162Sep 19, 2022Updated 3 years ago
acids-ircam / ddsp_pytorch
View on GitHub
Implementation of Differentiable Digital Signal Processing (DDSP) in Pytorch
☆518Oct 28, 2023Updated 2 years ago
kan-bayashi / ParallelWaveGAN
View on GitHub
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
☆1,646Apr 22, 2024Updated 2 years ago