johnmartinsson/differentiable-mel-spectrogram

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/johnmartinsson/differentiable-mel-spectrogram)

johnmartinsson / differentiable-mel-spectrogram

The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer in neural networks".

☆24

Alternatives and similar repositories for differentiable-mel-spectrogram

Users that are interested in differentiable-mel-spectrogram are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

WildHoneyPie / BEAST
View on GitHub
Codes for ICASSP 2024 paper: BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer. An online beat tracking syste…
☆44Sep 11, 2024Updated last year
cwitkowitz / ss-mpe
View on GitHub
Code for the paper "Toward Fully Self-Supervised Multi-Pitch Estimation".
☆25Sep 27, 2025Updated 9 months ago
stefan-balke / mpa-exc
View on GitHub
Some Demo Code for the MPA Exercise.
☆10Dec 4, 2017Updated 8 years ago
gudgud96 / piano-synthesis
View on GitHub
Code accompanying ML4MD ICML 2020 paper - "Generative Modelling for Controllable Audio Synthesis of Expressive Piano Performance".
☆31Jul 22, 2020Updated 5 years ago
KinWaiCheuk / Jointist
View on GitHub
Official Implementation of Jointist
☆37Jul 26, 2023Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
bbc / dsrp_bbcavs10k_distribution
View on GitHub
Repo for the BBCAVS10k distribution
☆10Nov 27, 2024Updated last year
LiChaiUSTC / CSL-L2M
View on GitHub
☆18May 4, 2025Updated last year
Yip-Jia-Qi / codecformer
View on GitHub
☆21Jul 15, 2024Updated 2 years ago
genisplaja / diffusion-vocal-sep
View on GitHub
Code for "A diffusion-inspired training strategy for singing voice extraction in the waveform domain" (ISMIR 2022)
☆17Feb 16, 2023Updated 3 years ago
mushanshanshan / ESLTTS
View on GitHub
ESLTTS dataset
☆16Feb 6, 2025Updated last year
SubramaniKrishna / point-cloud-audio
View on GitHub
Accompanying code for our paper "Point Cloud Audio Processing"
☆18Jul 1, 2021Updated 5 years ago
IoBT-VISTEC / MUSEC
View on GitHub
For accessing to the dataset, please send your short bio and objective of the study to Dr.Theerawit Wilaiprasitporn (theerawit dot w at v…
☆14Apr 29, 2021Updated 5 years ago
SonyResearch / VRVQ
View on GitHub
Variable Bitrate Residual Vector Quantization for Audio Coding
☆54May 1, 2025Updated last year
ismir-24-sub / unsupervised_compositional_representations
View on GitHub
ISMIR 24 Supplementary Material
☆14Oct 28, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Audio-WestlakeU / audiossl
View on GitHub
A library built for easier audio self-supervised training, downstream tasks evaluation
☆140Sep 25, 2025Updated 9 months ago
shtdbb / MusicTextAlignment
View on GitHub
This is a dataset that aligns piano music MIDI with their corresponding textual descriptions and comments. It can be used for multi-modal…
☆12Nov 21, 2023Updated 2 years ago
rickiepark / cnn_mer
View on GitHub
☆10Nov 6, 2017Updated 8 years ago
mdx-workshop / mdx-submissions21
View on GitHub
Music Demixing Challenge Submission Repo
☆16Sep 8, 2023Updated 2 years ago
SonyResearch / diffvox
View on GitHub
Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"
☆40Oct 28, 2025Updated 8 months ago
CGCL-codes / Gen-AF
View on GitHub
The implementation of our IEEE S&P 2024 paper "Securely Fine-tuning Pre-trained Encoders Against Adversarial Examples".
☆11Jun 28, 2024Updated 2 years ago
mbsantiago / whombat
View on GitHub
Audio Annotation Tool for ML development
☆91Jul 8, 2026Updated last week
christhetree / mod_discovery
View on GitHub
Source code for "Modulation Discovery with Differentiable Digital Signal Processing".
☆15Mar 25, 2026Updated 3 months ago
Hayeonbang / PIAST
View on GitHub
A piano music dataset with Audio, Symbolic and Text labels
☆36Mar 6, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
bshall / dusted
View on GitHub
DUSTED: Spoken-Term Discovery using Discrete Speech Units
☆17Oct 2, 2024Updated last year
aim-qmul / sdx23-aimless
View on GitHub
Source Separation training codebase for the Sound Demixing Challenge 2023.
☆45May 18, 2023Updated 3 years ago
christhetree / scrapl
View on GitHub
Scattering Transform with Random Paths for Machine Learning
☆16Apr 9, 2026Updated 3 months ago
ilya16 / ScorePerformer
View on GitHub
ScorePerformer: Expressive Piano Performance Rendering with Fine-Grained Control (ISMIR 2023)
☆42Mar 10, 2025Updated last year
xiaoxue1117 / speech-mamba-public
View on GitHub
☆15Nov 26, 2024Updated last year
DeepSpectrum / DeepSpectrumLite
View on GitHub
Light-weight transfer learning framework for on-device speech and audio recognition using pre-trained image convolutional neural networks…
☆18Apr 16, 2022Updated 4 years ago
SubramaniKrishna / STFTgrad
View on GitHub
Accompanying code for our paper "Optimizing Short-Time Fourier Transform Parameters via Gradient Descent"
☆33Oct 30, 2020Updated 5 years ago
zelaki / DisfluentFA
View on GitHub
A Weakly Supervised Forced Alignment for disluent speech
☆15Nov 12, 2023Updated 2 years ago
NAVER-INTEL-Co-Lab / gaudi-lavcap
View on GitHub
☆15Jan 24, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jordipons / CBMI2016
View on GitHub
Experimenting with musically motivated convolutional neural networks
☆16Jun 8, 2016Updated 10 years ago
maxrmorrison / torbi
View on GitHub
Viterbi decoding in PyTorch
☆42May 5, 2026Updated 2 months ago
aeromamba-super-resolution / aeromamba
View on GitHub
Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…
☆50Nov 11, 2025Updated 8 months ago
YatingMusic / MusiConGen
View on GitHub
☆88Oct 20, 2024Updated last year
Netflix-Skunkworks / listening-test-app
View on GitHub
☆21May 23, 2024Updated 2 years ago
cyrusasfa / meso-dtfa
View on GitHub
Mesostructures: Beyond Spectrogram Loss in Differentiable Time-Frequency Analysis (Meso-DTFA)
☆21Jun 30, 2026Updated 2 weeks ago
YoonjinXD / kadtk
View on GitHub
A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …
☆104Jun 12, 2025Updated last year