A pip installable package for optimal transport inspired loss functions in the spectral domain. Can be used for audio applications such as differentiable digital signal processing or audio signal comparison.
☆29Dec 5, 2025Updated 3 months ago
Alternatives and similar repositories for spectral-optimal-transport
Users that are interested in spectral-optimal-transport are comparing it to the libraries listed below
Sorting:
- Acoustic impulse response generation using diffusion models☆76Oct 3, 2023Updated 2 years ago
- ☆28Sep 5, 2024Updated last year
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆36Jun 20, 2023Updated 2 years ago
- pytorch code for sound event localization and classification☆13Aug 12, 2021Updated 4 years ago
- Deep Probabalistic Models: Materials for my course for the Australian Mathematical Sciences Institute (AMSI) Winter School 2021☆11Feb 7, 2022Updated 4 years ago
- VocalVerse: A powerful vocal evaluation framework powered by the Qwen LLMs☆38Jan 22, 2026Updated last month
- Unconditional music synthesis using a diffusion model in the STFT domain☆12May 31, 2022Updated 3 years ago
- ☆20Sep 20, 2024Updated last year
- Official implementation of the paper How to Listen? Rethinking Visual Sound Localization☆18Apr 25, 2022Updated 3 years ago
- Pytorch implementation of the icosahedral CNNs☆20Apr 24, 2023Updated 2 years ago
- Companion code of DAFx23 "Differentiable Feedback Delay Network for Colorless Reverberation"☆51Apr 7, 2025Updated 11 months ago
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- Diffraction Enhanced Image Source Method (Python)☆30Updated this week
- DDSP experiments in Faust☆30Feb 12, 2025Updated last year
- Code for the paper "Toward Fully Self-Supervised Multi-Pitch Estimation".☆23Sep 27, 2025Updated 5 months ago
- Official repository of the work "Speaker Distance Estimation in Enclosures from Single-Channel Audio" published to IEEE/ACM Transactions …☆30Nov 18, 2025Updated 3 months ago
- BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models☆63Oct 18, 2024Updated last year
- collection of pitch (f0, fundamental frequency) detection algorithms with unified interface☆25Nov 25, 2024Updated last year
- Gaussian processes for sound field reconstruction☆22Nov 5, 2020Updated 5 years ago
- Official implementation of the ICASSP 2023 paper "HRTF Field: Unifying Measured HRTF Magnitude Representation with Neural Fields"☆26Dec 3, 2023Updated 2 years ago
- Official repository of Wavehax vocoder☆66Dec 20, 2025Updated 2 months ago
- Alignment examples for Interspeech 2024☆27Jul 5, 2024Updated last year
- Fast and differentiable time domain all-pole filter in PyTorch.☆68Feb 5, 2026Updated last month
- This is a Python implementation of the Auditory Toolbox☆71Oct 3, 2024Updated last year
- ☆37Sep 21, 2025Updated 5 months ago
- PINN for solving wave equation☆24Dec 17, 2022Updated 3 years ago
- ☆31Aug 22, 2025Updated 6 months ago
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆121Mar 14, 2023Updated 2 years ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆69Apr 27, 2023Updated 2 years ago
- Official repository for GraFPrint: an audio identification framework based on graph neural networks.☆38Sep 18, 2025Updated 5 months ago
- Differentiable audio signal processors in PyTorch☆285Dec 4, 2023Updated 2 years ago
- SpectroMap is a peak detection algorithm that computes the constellation map for a given signal☆32Jun 19, 2024Updated last year
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- Code for Look for the Change paper published at CVPR 2022☆36Oct 26, 2022Updated 3 years ago
- Sylber: Syllabic Embedding Representation of Speech from Raw Audio☆74Mar 17, 2025Updated 11 months ago
- The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"☆34Nov 23, 2023Updated 2 years ago
- [ICASSP 2024] Official code for FreGrad☆35May 13, 2024Updated last year
- Code for sound field predictions in domains with impedance boundaries. Used for generating results from the paper "Physics-informed neura…☆36Jan 15, 2024Updated 2 years ago
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆37Jun 24, 2025Updated 8 months ago