TomohikoNakamura / dwtls
Discrete wavelet transform layers with fixed and trainable wavelets
☆21Updated last year
Related projects: ⓘ
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 2 years ago
- Audio Masking Methods☆10Updated 4 years ago
- ☆17Updated 3 years ago
- Accompanying code for our paper "Optimizing Short-Time Fourier Transform Parameters via Gradient Descent"☆31Updated 3 years ago
- Repository of published DNN speech separation recipes for a number of datasets☆10Updated 7 months ago
- ☆17Updated this week
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆28Updated last year
- Learning Complex Basis Functions for Invariant Signal Representations with the Complex Autoencoder☆33Updated 6 months ago
- Official PyTorch implementation for "Towards Lightweight Controllable Audio Synthesis with Conditional Implicit Neural Representations".☆20Updated 2 years ago
- A small tool to calculate the distribution of audio durations in a directory☆13Updated last year
- ☆9Updated 7 months ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 2 years ago
- STOI loss functions in PyTorch (mirror of https://github.com/mpariente/pytorch_stoi)☆12Updated 4 years ago
- Multi-Resolution Neural Networks☆10Updated this week
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆21Updated 5 months ago
- Official implementation of EfficientLEAF, a learnable audio frontend.☆39Updated last year
- ☆14Updated 2 years ago
- SRTNet☆24Updated last year
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆25Updated this week
- code for "DDD: A Perceptually Superior Low-Response-Time DNN-Based Declipper"☆17Updated 5 months ago
- Official implementation of the ICASSP 2023 paper "HRTF Field: Unifying Measured HRTF Magnitude Representation with Neural Fields"☆21Updated 9 months ago
- Simple baseline model for the HEAR benchmark☆22Updated last month
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆20Updated 6 months ago
- The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer …☆17Updated 4 months ago
- A PyTorch implementation of the paper: "AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries" (ACM Multimedia 2021…☆20Updated 3 years ago
- ☆21Updated this week
- ☆16Updated last year
- Code for "A diffusion-inspired training strategy for singing voice extraction in the waveform domain" (ISMIR 2022)☆16Updated last year
- Addressing the confounds of accompaniments in singer identification☆18Updated 4 years ago
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Updated 2 years ago