nomonosound / log-wmse-audio-qualityLinks
logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even when there are many audio tracks or stems.
☆35Updated 2 months ago
Alternatives and similar repositories for log-wmse-audio-quality
Users that are interested in log-wmse-audio-quality are comparing it to the libraries listed below
Sorting:
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆48Updated last week
- Differentiable dynamic range controller in PyTorch.☆51Updated 9 months ago
- Project for MIDI to Audio Synthesis☆25Updated 2 years ago
- Reproducible Subjective Evaluation☆60Updated last year
- PodcastMix A dataset for separating music and speech in podcasts.☆44Updated last year
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆43Updated 3 months ago
- Frechet Audio Distance evaluation in PyTorch☆36Updated 2 years ago
- Full models and training code for PESTO☆69Updated last year
- ☆67Updated last year
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆32Updated 6 months ago
- Unsupervised Music Source Separation Using Differentiable Parametric Source Models☆63Updated 2 years ago
- Fast and differentiable time domain all-pole filter in PyTorch.☆65Updated last month
- PyTorch Dataset for Speech and Music audio☆78Updated last year
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆70Updated 2 years ago
- Audio-to-Audio Schrodinger Bridges is a diffusion-based audio restoration model for bandwidth extension and inpainting.☆89Updated last month
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆42Updated 2 years ago
- A Python Library for Fundamental Frequency Estimation in Music Recordings☆50Updated 4 months ago
- Landing Page for Divide and Remaster v3☆20Updated last month
- Landing Page for All Things Source Separation☆33Updated this week
- Official Repository for "Training-Free Multi-Step Audio Source Separation"☆52Updated 3 months ago
- Code for the paper "Toward Fully Self-Supervised Multi-Pitch Estimation".☆23Updated 7 months ago
- Implementation for "Music Enhancement via Image Translation and Vocoding"☆55Updated 3 years ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 3 years ago
- logWMSE, an audio quality metric & loss function with support for digital silence target. Useful for training and evaluating audio source…☆41Updated 4 months ago
- Prosody and Pronunciation Modification Network☆56Updated 4 months ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆37Updated 3 months ago
- This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBER…☆33Updated 3 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆62Updated 2 years ago
- ☆25Updated last year