nomonosound / log-wmse-audio-qualityLinks
logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even when there are many audio tracks or stems.
☆36Updated 4 months ago
Alternatives and similar repositories for log-wmse-audio-quality
Users that are interested in log-wmse-audio-quality are comparing it to the libraries listed below
Sorting:
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆48Updated 2 weeks ago
- Landing Page for Divide and Remaster v3☆22Updated 3 months ago
- Project for MIDI to Audio Synthesis☆25Updated 2 years ago
- Reproducible Subjective Evaluation☆61Updated last year
- PyTorch Dataset for Speech and Music audio☆78Updated last year
- Differentiable dynamic range controller in PyTorch.☆51Updated last month
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆70Updated 2 years ago
- PodcastMix A dataset for separating music and speech in podcasts.☆44Updated last year
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆44Updated 2 years ago
- Frechet Audio Distance evaluation in PyTorch☆35Updated 2 years ago
- This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBER…☆33Updated 3 years ago
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆44Updated this week
- Full models and training code for PESTO☆71Updated last year
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆35Updated 8 months ago
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆65Updated 2 years ago
- Fast and differentiable time domain all-pole filter in PyTorch.☆65Updated 2 months ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆39Updated 6 months ago
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆33Updated 2 years ago
- Implementation for "Music Enhancement via Image Translation and Vocoding"☆55Updated 3 years ago
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆120Updated 2 years ago
- Official Repository for "Training-Free Multi-Step Audio Source Separation"☆52Updated 5 months ago
- Unsupervised Music Source Separation Using Differentiable Parametric Source Models☆63Updated 2 years ago
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆23Updated last year
- ☆71Updated last year
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 3 years ago
- Prosody and Pronunciation Modification Network☆59Updated 6 months ago
- logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source…☆44Updated 6 months ago
- PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind☆81Updated 3 months ago
- ☆30Updated 2 years ago
- ☆19Updated 4 years ago