Pytorch implementation of "A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences", Pranay Manocha et al. - unofficial work in progress
☆64Apr 2, 2020Updated 5 years ago
Alternatives and similar repositories for PerceptualAudio_Pytorch
Users that are interested in PerceptualAudio_Pytorch are comparing it to the libraries listed below
Sorting:
- Perceptual Metrics of Audio - perceptually relevant loss function. DPAM and CDPAM☆377Mar 24, 2023Updated 2 years ago
- Sound Morphing Toolbox (SMT)☆31Jul 8, 2022Updated 3 years ago
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆37Jun 24, 2025Updated 8 months ago
- Upsampling Artifacts in Neural Audio Synthesis – https://arxiv.org/abs/2010.14356☆82Feb 9, 2021Updated 5 years ago
- Collection of audio-focused loss functions in PyTorch☆851Jul 30, 2024Updated last year
- AQP is a modular pipeline built to enable the comparison and testing of different quality metric configurations.☆33Jun 13, 2022Updated 3 years ago
- Scripts for recreating the Replication Dataset for Fundamental Frequency Estimation. Part of the dissertation "Pitch of Voiced Speech in …☆11Mar 29, 2021Updated 4 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations☆48Jul 25, 2024Updated last year
- Neural network model of the analog LA-2A dynamic range compressor☆23May 2, 2022Updated 3 years ago
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Apr 8, 2021Updated 4 years ago
- Accompanying code for our paper "Optimizing Short-Time Fourier Transform Parameters via Gradient Descent"☆34Oct 30, 2020Updated 5 years ago
- A python implementation of a traditional Dynamic Range Compressor☆14Oct 30, 2020Updated 5 years ago
- Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation☆14Nov 16, 2020Updated 5 years ago
- companion repository to the DAFx-19 paper "Assisted Sound Sample Generation with Musical Conditioning in Adversarial Auto-Encoders" by Ad…☆11Jun 22, 2019Updated 6 years ago
- Echo aware source separation☆13May 29, 2018Updated 7 years ago
- A convolutional generative audio synthesis model☆32Jun 17, 2022Updated 3 years ago
- Fast PyTorch based DSP for audio and 1D signals☆452Feb 17, 2025Updated last year
- multi-channel target speech extraction with channel decorrelation and target speaker adaptation☆27Feb 19, 2021Updated 5 years ago
- Zounds is a dataflow library for building directed acyclic graphs that transform audio. It uses the featureflow library to define the pro…☆24Dec 8, 2022Updated 3 years ago
- NeuroDrum - A neural network based percussion synthesiser☆33Apr 14, 2020Updated 5 years ago
- A PyTorch implementation of the musicnn model for music audio tagging☆38Jul 25, 2024Updated last year
- J-Net is aimed for audio separation with randomly weighted encoder.☆12Oct 23, 2019Updated 6 years ago
- Music Demixing Challenge Submission Repo☆15Sep 8, 2023Updated 2 years ago
- Official PyTorch implementation of "EdVAE: Mitigating Codebook Collapse with Evidential Discrete Variational Autoencoders"☆14Sep 20, 2024Updated last year
- A library for speech data augmentation in time-domain☆683Aug 30, 2021Updated 4 years ago
- ☆53May 15, 2025Updated 9 months ago
- Implementation of "MOSNet: Deep Learning based Objective Assessment for Voice Conversion"☆378Jul 21, 2024Updated last year
- Pytorch implementation of the CREPE pitch tracker☆508May 16, 2025Updated 9 months ago
- Experimenting with Lapped Transforms Jupyter Notebook☆14Jun 13, 2025Updated 8 months ago
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- Official pytorch implementation of the paper: "Catch-A-Waveform: Learning to Generate Audio from a Single Short Example" (NeurIPS 2021)☆191Apr 2, 2024Updated last year
- Non Stationary Gabor Transform (NSGT), Python implementation☆105Nov 10, 2023Updated 2 years ago
- Pitch-shifting and time-stretching with TD-PSOLA☆88Aug 16, 2023Updated 2 years ago
- Reproducible Subjective Evaluation☆61Mar 3, 2024Updated last year
- We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…☆43Mar 3, 2025Updated 11 months ago
- Hierarchical fast and high-fidelity audio generation☆75Jul 25, 2024Updated last year
- Official implementation of SawSing (ISMIR'22)☆272Aug 28, 2022Updated 3 years ago
- ☆88Nov 1, 2022Updated 3 years ago