andrasschin / SepMamba
☆31Updated last month
Alternatives and similar repositories for SepMamba:
Users that are interested in SepMamba are comparing it to the libraries listed below
- ☆46Updated 2 months ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆24Updated 10 months ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆25Updated 2 months ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆24Updated 4 months ago
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- This is the official implementation of the LiSenNet☆55Updated 3 months ago
- Prediction of sound event bounding boxes (SEBBs)☆25Updated 6 months ago
- ☆26Updated last year
- Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"☆13Updated 5 months ago
- BUDDy: Single-Channel Blind Unsupervised Dereverberation with Diffusion Models☆38Updated 4 months ago
- Fully Quantized Neural Networks For Speech Enhancement☆61Updated last year
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆37Updated 3 months ago
- Official PyTorch implementation of "RVAE-EM: Generative speech dereverberation based on recurrent variational auto-encoder and convolutiv…☆42Updated 3 weeks ago
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆45Updated last month
- Generation scripts for EARS-WHAM and EARS-Reverb☆30Updated 5 months ago
- Landing Page for Divide and Remaster v3☆17Updated 7 months ago
- Spherical residual vector quantization (SRVQ)☆28Updated 5 months ago
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆34Updated 3 months ago
- Transformer with Local Modeling by Convolution for Speech Separation and Enhancement☆39Updated 6 months ago
- ☆60Updated last year
- Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"☆13Updated 3 years ago
- ☆15Updated 7 months ago
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆41Updated 4 months ago
- Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models☆19Updated last year
- ☆26Updated last month
- ☆21Updated last year
- Speech Human Evaluation Estimation Toolkit (SHEET)☆52Updated 3 months ago
- ☆48Updated last year
- ☆20Updated last year
- Multipurpose Multi Speaker Mixture Signal Generator☆44Updated 2 weeks ago