Kinyugo / torch_mdct
A PyTorch implementation of the Modified Discrete Cosine Transform (MDCT) and its inverse for audio processing.
☆23Updated 3 months ago
Alternatives and similar repositories for torch_mdct:
Users that are interested in torch_mdct are comparing it to the libraries listed below
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 2 years ago
- ☆10Updated 2 years ago
- Official implementation of Self-Remixing☆13Updated last year
- A small tool to calculate the distribution of audio durations in a directory☆14Updated 2 years ago
- ☆61Updated last year
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆21Updated 2 years ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆29Updated 3 months ago
- Spherical residual vector quantization (SRVQ)☆28Updated 7 months ago
- Audio samples for the paper "TinyLSTMs: Efficient Neural Speech Enhancement for Hearing Aids"☆41Updated 4 years ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆13Updated 2 years ago
- The implementation of MDNet, which is in submission to Interspeech2022☆13Updated 2 years ago
- STOI loss functions in PyTorch (mirror of https://github.com/mpariente/pytorch_stoi)☆15Updated 4 years ago
- Implementation of "A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement" by pytorch☆28Updated 3 years ago
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Updated 3 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 2 years ago
- Landing Page for Divide and Remaster v3☆17Updated 8 months ago
- Da - ECHO - RetrievAl - daTasEt☆26Updated 8 months ago
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆18Updated 2 years ago
- Unofficial PyTorch implementation of "SCNet: Sparse Compression Network for Music Source Separation"☆53Updated 11 months ago
- Paderbox: A collection of utilities for audio / speech processing☆38Updated last month
- Official implementation of EfficientLEAF, a learnable audio frontend.☆40Updated 2 years ago
- ☆26Updated last month
- A C++/Cython audio limiter for Python.☆25Updated 2 years ago
- ☆20Updated 5 months ago
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆18Updated 2 years ago
- A temporal module for PyTorch-ComplexTensor☆44Updated 9 months ago
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆29Updated last year
- ☆48Updated 2 years ago
- ☆11Updated 2 years ago
- Differentiable implementation of MSBG hearing loss model and MBSTOI intelligibility metric for Clarity Enhancement challenge.☆16Updated 3 years ago