The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer in neural networks".
☆22Dec 21, 2024Updated last year
Alternatives and similar repositories for differentiable-mel-spectrogram
Users that are interested in differentiable-mel-spectrogram are comparing it to the libraries listed below
Sorting:
- Code for the paper "Toward Fully Self-Supervised Multi-Pitch Estimation".☆23Sep 27, 2025Updated 5 months ago
- ☆21Jul 15, 2024Updated last year
- ☆18May 4, 2025Updated 9 months ago
- Official Implementation of Jointist☆37Jul 26, 2023Updated 2 years ago
- applying audio FX with text descriptors☆33Nov 12, 2025Updated 3 months ago
- Codes for ICASSP 2024 paper: BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer. An online beat tracking syste…☆41Sep 11, 2024Updated last year
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆50Nov 11, 2025Updated 3 months ago
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆38Oct 28, 2025Updated 4 months ago
- ☆28Sep 5, 2024Updated last year
- ☆10Dec 22, 2023Updated 2 years ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 10 months ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- Repo for the BBCAVS10k distribution☆10Nov 27, 2024Updated last year
- [INTERSPEECH 2024] Official code for VoxSim: A perceptual voice similarity dataset☆12Sep 29, 2025Updated 5 months ago
- Some Demo Code for the MPA Exercise.☆10Dec 4, 2017Updated 8 years ago
- A Python Library for Fundamental Frequency Estimation in Music Recordings☆55Jan 16, 2026Updated last month
- Streaming Vocos☆30Jun 10, 2025Updated 8 months ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆51May 1, 2025Updated 10 months ago
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆95Jun 12, 2025Updated 8 months ago
- 基于PC-DDSP和nsf-HiFiGAN的声码器☆18Jul 17, 2023Updated 2 years ago
- ☆10Nov 6, 2017Updated 8 years ago
- Examples of how to use API of MVSep service☆29Jun 21, 2025Updated 8 months ago
- For accessing to the dataset, please send your short bio and objective of the study to Dr.Theerawit Wilaiprasitporn (theerawit dot w at v…☆14Apr 29, 2021Updated 4 years ago
- ☆15Nov 11, 2024Updated last year
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆11Mar 14, 2025Updated 11 months ago
- ☆14Nov 26, 2024Updated last year
- Official repository for GraFPrint: an audio identification framework based on graph neural networks.☆37Sep 18, 2025Updated 5 months ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆32Apr 10, 2023Updated 2 years ago
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆38Feb 24, 2025Updated last year
- Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"☆32Mar 4, 2025Updated 11 months ago
- A piano music dataset with Audio, Symbolic and Text labels☆34Mar 6, 2025Updated 11 months ago
- Music Demixing Challenge Submission Repo☆15Sep 8, 2023Updated 2 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- This is a dataset that aligns piano music MIDI with their corresponding textual descriptions and comments. It can be used for multi-modal…☆12Nov 21, 2023Updated 2 years ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 11 months ago
- Prosody and Pronunciation Modification Network☆63May 5, 2025Updated 9 months ago
- Digital Speech Processing in PyTorch.☆15Aug 12, 2022Updated 3 years ago
- Feed-forward compressor experiments source code for "Differentiable All-pole Filters for Time-varying Audio Systems".☆22Jun 10, 2024Updated last year
- ☆85Oct 20, 2024Updated last year