☆30Nov 13, 2021Updated 4 years ago
Alternatives and similar repositories for Discriminator-Constrained-Optimal-Transport-Network
Users that are interested in Discriminator-Constrained-Optimal-Transport-Network are comparing it to the libraries listed below
Sorting:
- transformer based neural network for speech enhancement in time domain☆79Mar 3, 2022Updated 4 years ago
- WaveCRN: An Efficient Convolutional Recurrent Neural Network for End-to-end Speech Enhancement☆42Sep 26, 2020Updated 5 years ago
- An unofficial code reproduction of Channel Attention Dense U-Net for Multichannel Speech Enhancement☆13Jul 17, 2023Updated 2 years ago
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Oct 25, 2021Updated 4 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Jun 23, 2022Updated 3 years ago
- Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset☆15Apr 7, 2025Updated 11 months ago
- Jupyter Notebook to accompany "Fixed-point DSP for Data Scientists" blog post 🧑💻🧑🔬☆16Mar 29, 2023Updated 2 years ago
- Official baseline for ICASSP 2026 URGENT Challenge Track 2 (Speech Quality Assessment)☆28Jan 8, 2026Updated 2 months ago
- Mini-batch multiplicative updates for beta-NMF☆15Jun 27, 2017Updated 8 years ago
- Repo for our pooling approach on the DCASE2018 task4☆15Jul 6, 2023Updated 2 years ago
- ☆18Jan 18, 2024Updated 2 years ago
- The official repo: "McNet: Fuse Multiple Cues for Multichannel Speech Enhancement", ICASSP 2023☆130Mar 24, 2023Updated 2 years ago
- A temporal module for PyTorch-ComplexTensor☆44Jun 28, 2024Updated last year
- ☆42Oct 30, 2019Updated 6 years ago
- Efficient Personalized Speech Enhancement through Self-Supervised Learning☆23Mar 12, 2023Updated 2 years ago
- Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"☆43Oct 30, 2025Updated 4 months ago
- ☆23Jun 30, 2023Updated 2 years ago
- CP-JKU submission to DCASE 20☆45Apr 19, 2021Updated 4 years ago
- Quality-Net: An End-to-End Non-intrusive Speech Quality Assessment Model based on BLSTM. (Interspeech, 2018, with Travel Grants)☆92Jul 22, 2019Updated 6 years ago
- Jupyter notebook for DCASE 2020 challenge Task 1☆20Jun 24, 2020Updated 5 years ago
- StoRM: A Diffusion-based Stochastic Regeneration Model for Speech Enhancement and Dereverberation☆254Sep 13, 2024Updated last year
- ☆70Feb 9, 2024Updated 2 years ago
- DNN-based SE in the frequency domain using Pytorch. You can test some state-of-the-art networks using T-F masking or spectral mapping met…☆58Apr 2, 2022Updated 3 years ago
- Noise Adaptive Speech Enhancement using Domain Adversarial Training☆23Jul 25, 2019Updated 6 years ago
- Official repository for the paper Multimodal Transformer Distillation for Audio-Visual Synchronization (ICASSP 2024).☆28Apr 3, 2024Updated last year
- Acoustic Scene Classification Using Deep Residual Networks with Late Fusion of Separated High and Low Frequency Paths - McDonnell and Gao…☆22Jul 3, 2024Updated last year
- Speech analysis and synthesis using linear predictive coding (LPC) in Matlab☆27Nov 22, 2016Updated 9 years ago
- PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."☆597Aug 19, 2023Updated 2 years ago
- VoxCeleb plugin for pyannote.database☆30Aug 4, 2021Updated 4 years ago
- ☆114Jan 8, 2021Updated 5 years ago
- A neural network consist of cnn and lstm for speech enhancement☆25Aug 2, 2018Updated 7 years ago
- ☆66Apr 29, 2021Updated 4 years ago
- Code and generated sounds for "Conditional Sound Generation Using Neural Discrete Time-Frequency Representation Learning", MLSP 2021☆69Sep 3, 2021Updated 4 years ago
- The open source code of ALMTokenizer2: Towards Low bit-rate and Semantic-rich Audio Tokenizer with Flow-based Scalar Diffusion Transforme…☆45Sep 5, 2025Updated 6 months ago
- ☆67Sep 13, 2022Updated 3 years ago
- Fast algorithm for determined blind source separation with update of demixing filters with joint adjustment of the remaining sources.☆35Mar 22, 2021Updated 4 years ago
- Python implementation of the paper " Dynamic Temporal Alignment of Speech to Lips"☆32May 16, 2019Updated 6 years ago
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆251Dec 12, 2025Updated 2 months ago
- Paper, Code and Statistics for Self-Supervised Learning and Pre-Training on Speech.☆211Jan 18, 2024Updated 2 years ago