Kinyugo / torch_mdct
A PyTorch implementation of the Modified Discrete Cosine Transform (MDCT) and its inverse for audio processing.
☆23Updated 4 months ago
Alternatives and similar repositories for torch_mdct:
Users that are interested in torch_mdct are comparing it to the libraries listed below
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 3 years ago
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 2 years ago
- The implementation of MDNet, which is in submission to Interspeech2022☆13Updated 2 years ago
- Official implementation of DualCycleGAN for nonparallel audio super resolution☆53Updated 2 years ago
- Official implementation of EfficientLEAF, a learnable audio frontend.☆40Updated 2 years ago
- ☆20Updated 6 months ago
- ☆83Updated last year
- Audio samples for the paper "TinyLSTMs: Efficient Neural Speech Enhancement for Hearing Aids"☆42Updated 4 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 2 years ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆21Updated 2 years ago
- Generalized Minimal Distortion Principle for Blind Source Separation☆20Updated 4 years ago
- Repository of published DNN speech separation recipes for a number of datasets☆12Updated last year
- ☆61Updated last year
- A small tool to calculate the distribution of audio durations in a directory☆14Updated 2 years ago
- Implementation of "A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement" by pytorch☆28Updated 3 years ago
- Da - ECHO - RetrievAl - daTasEt☆26Updated 9 months ago
- Official implementation of Self-Remixing☆13Updated last year
- ☆16Updated last year
- Spherical residual vector quantization (SRVQ)☆28Updated 7 months ago
- ☆10Updated 2 years ago
- real-time speech enhance☆14Updated last year
- Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction☆11Updated 9 months ago
- Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings☆15Updated 2 months ago
- Unofficial PyTorch implementation of "SCNet: Sparse Compression Network for Music Source Separation"☆53Updated last year
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆19Updated 2 years ago
- Zero-Shot Blind Audio Bandwidth Extension☆21Updated last year
- ☆25Updated last year
- ☆28Updated 11 months ago
- Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"☆13Updated 2 years ago
- Official Implementation of "Inference and Denoise: Causal Inference-based Neural Speech Enhancement"☆27Updated 2 years ago