bernardo-torres / spectral-optimal-transportView external linksLinks
A pip installable package for optimal transport inspired loss functions in the spectral domain. Can be used for audio applications such as differentiable digital signal processing or audio signal comparison.
☆29Dec 5, 2025Updated 2 months ago
Alternatives and similar repositories for spectral-optimal-transport
Users that are interested in spectral-optimal-transport are comparing it to the libraries listed below
Sorting:
- ☆27Sep 5, 2024Updated last year
- Acoustic impulse response generation using diffusion models☆76Oct 3, 2023Updated 2 years ago
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆36Jun 20, 2023Updated 2 years ago
- pytorch code for sound event localization and classification☆13Aug 12, 2021Updated 4 years ago
- Deep Probabalistic Models: Materials for my course for the Australian Mathematical Sciences Institute (AMSI) Winter School 2021☆11Feb 7, 2022Updated 4 years ago
- VocalVerse: A powerful vocal evaluation framework powered by the Qwen LLMs☆37Jan 22, 2026Updated 3 weeks ago
- ☆19Sep 20, 2024Updated last year
- Unconditional music synthesis using a diffusion model in the STFT domain☆12May 31, 2022Updated 3 years ago
- Pytorch implementation of the icosahedral CNNs☆20Apr 24, 2023Updated 2 years ago
- Companion code of DAFx23 "Differentiable Feedback Delay Network for Colorless Reverberation"☆51Apr 7, 2025Updated 10 months ago
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- Diffraction Enhanced Image Source Method (Python)☆30Updated this week
- Code for the paper "Toward Fully Self-Supervised Multi-Pitch Estimation".☆23Sep 27, 2025Updated 4 months ago
- Official repository of the work "Speaker Distance Estimation in Enclosures from Single-Channel Audio" published to IEEE/ACM Transactions …☆28Nov 18, 2025Updated 2 months ago
- BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models☆62Oct 18, 2024Updated last year
- Official implementation of the ICASSP 2023 paper "HRTF Field: Unifying Measured HRTF Magnitude Representation with Neural Fields"☆25Dec 3, 2023Updated 2 years ago
- Gaussian processes for sound field reconstruction☆22Nov 5, 2020Updated 5 years ago
- collection of pitch (f0, fundamental frequency) detection algorithms with unified interface☆24Nov 25, 2024Updated last year
- Official repository of Wavehax vocoder☆66Dec 20, 2025Updated last month
- Alignment examples for Interspeech 2024☆27Jul 5, 2024Updated last year
- Fast and differentiable time domain all-pole filter in PyTorch.☆68Feb 5, 2026Updated last week
- This is a Python implementation of the Auditory Toolbox☆71Oct 3, 2024Updated last year
- PINN for solving wave equation☆24Dec 17, 2022Updated 3 years ago
- ☆37Sep 21, 2025Updated 4 months ago
- ☆31Aug 22, 2025Updated 5 months ago
- Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23☆121Mar 14, 2023Updated 2 years ago
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆69Apr 27, 2023Updated 2 years ago
- Differentiable audio signal processors in PyTorch☆283Dec 4, 2023Updated 2 years ago
- Official repository for GraFPrint: an audio identification framework based on graph neural networks.☆36Sep 18, 2025Updated 4 months ago
- SpectroMap is a peak detection algorithm that computes the constellation map for a given signal☆32Jun 19, 2024Updated last year
- Reproducing code for Learning Disentangled Representations of Timbre and Pitch for Musical Instrument Sounds Using Gaussian Mixture Varia…☆29May 20, 2020Updated 5 years ago
- [ICASSP 2024] Official code for FreGrad☆35May 13, 2024Updated last year
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- Sylber: Syllabic Embedding Representation of Speech from Raw Audio☆72Mar 17, 2025Updated 11 months ago
- Code for sound field predictions in domains with impedance boundaries. Used for generating results from the paper "Physics-informed neura…☆35Jan 15, 2024Updated 2 years ago
- The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"☆34Nov 23, 2023Updated 2 years ago
- logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…☆37Jun 24, 2025Updated 7 months ago
- An overview of GRPO & DeepSeek-R1 Training with Open Source GRPO Model Fine Tuning☆36May 18, 2025Updated 8 months ago
- Research paper repository for "A Hand Structure-Based Mobile Authentication Solution to the Security-Reliability Trade-off" Paper from NJ…☆13Jul 30, 2023Updated 2 years ago