Kinyugo / torch_mdct
A PyTorch implementation of the Modified Discrete Cosine Transform (MDCT) and its inverse for audio processing.
☆23Updated 5 months ago
Alternatives and similar repositories for torch_mdct
Users that are interested in torch_mdct are comparing it to the libraries listed below
Sorting:
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 3 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 2 years ago
- Official implementation of Self-Remixing☆13Updated last year
- A small tool to calculate the distribution of audio durations in a directory☆14Updated 2 years ago
- The implementation of MDNet, which is in submission to Interspeech2022☆13Updated 3 years ago
- ☆83Updated last year
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆13Updated 3 years ago
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆30Updated last year
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆28Updated 2 months ago
- ☆10Updated 2 years ago
- Differentiable implementation of MSBG hearing loss model and MBSTOI intelligibility metric for Clarity Enhancement challenge.☆16Updated 3 years ago
- ☆12Updated last month
- Zero-Shot Blind Audio Bandwidth Extension☆21Updated last year
- Da - ECHO - RetrievAl - daTasEt☆26Updated 10 months ago
- ☆21Updated last year
- ☆17Updated 10 months ago
- Official implementation of the ICASSP 2023 paper "HRTF Field: Unifying Measured HRTF Magnitude Representation with Neural Fields"☆22Updated last year
- ☆13Updated last month
- Implementation of Sheffield entry for Clarity enhancement challenge.☆17Updated 3 years ago
- ☆25Updated last year
- Spherical residual vector quantization (SRVQ)☆28Updated 8 months ago
- STOI loss functions in PyTorch (mirror of https://github.com/mpariente/pytorch_stoi)☆15Updated 4 years ago
- Repository of published DNN speech separation recipes for a number of datasets☆12Updated last year
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆33Updated this week
- ☆16Updated last year
- Toolbox for Evaluation of AEC/AES Systems☆19Updated this week
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆18Updated 2 years ago
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆15Updated 3 months ago
- This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBER…☆31Updated 2 years ago
- Landing Page for Divide and Remaster v3☆17Updated 10 months ago