☆110Oct 1, 2024Updated last year
Alternatives and similar repositories for Mamba-TasNet
Users that are interested in Mamba-TasNet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ConMamba for Automatic Speech Recognition☆103Aug 12, 2024Updated last year
- ☆208Dec 5, 2024Updated last year
- This is the official implementation of the SEMamba paper. (Accepted to IEEE SLT 2024)☆256Dec 12, 2025Updated 3 months ago
- Transformer with Local Modeling by Convolution for Speech Separation and Enhancement☆123Aug 8, 2025Updated 7 months ago
- [SLT'24] The official implementation of SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model☆136Nov 5, 2025Updated 4 months ago
- Official repository of SepReformer for speech separation☆250Jan 13, 2025Updated last year
- ☆64Jun 28, 2023Updated 2 years ago
- This is the official implement of Mamba-SEUNet: Mamba UNet for Monaural Speech Enhancement☆91May 26, 2025Updated 9 months ago
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆82May 21, 2025Updated 10 months ago
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆57Apr 14, 2025Updated 11 months ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆54Mar 5, 2025Updated last year
- ☆67Aug 16, 2023Updated 2 years ago
- TIGER: Time-frequency Interleaved Gain Extraction and Reconstruction for Efficient Speech Separation☆397Oct 6, 2025Updated 5 months ago
- Single channel speech source separation by diffusion process (ICASSP 2023)☆126Mar 15, 2024Updated 2 years ago
- ☆22Jul 16, 2025Updated 8 months ago
- ☆21Jul 15, 2024Updated last year
- offical code for Dense-TSNet☆12Sep 17, 2024Updated last year
- Perceptual Contrast Stretching on Target Feature for Speech Enhancement (Accepted by INTERSPEECH 2022)☆73May 11, 2024Updated last year
- The official repo of NBC & SpatialNet for multichannel speech separation, denoising, and dereverberation☆339Jan 1, 2025Updated last year
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆102Apr 1, 2025Updated 11 months ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆98Sep 2, 2025Updated 6 months ago
- An unofficial implementation of Lite-RTSE, a cost-effective lite model for real-time speech enhancement☆14Nov 19, 2023Updated 2 years ago
- Implementation of SpatialCodec.☆69Sep 23, 2023Updated 2 years ago
- Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models☆26Feb 25, 2026Updated 3 weeks ago
- ☆54Jul 1, 2024Updated last year
- Official Repository for "Training-Free Multi-Step Audio Source Separation"☆54May 26, 2025Updated 9 months ago
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆47Nov 19, 2024Updated last year
- Model configurations for scaling SE models in the paper "Beyond Performance Plateaus: A Comprehensive Study on Scalability in Speech Enha…☆38Aug 7, 2024Updated last year
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆114Jan 28, 2026Updated last month
- Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement☆475May 19, 2025Updated 10 months ago
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆30Aug 2, 2025Updated 7 months ago
- ☆91Jun 9, 2024Updated last year
- DOSE: Diffusion Dropout with Adaptive Prior for Speech Enhancement, Conference on Neural Information Processing Systems (NeurIPS), 2023☆59May 16, 2025Updated 10 months ago
- Official code release for "RTFS-Net: Recurrent time-frequency modelling for efficient audio-visual speech separation", accepted ICLR 2024☆50Oct 14, 2025Updated 5 months ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- [ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes☆58Oct 8, 2025Updated 5 months ago
- Code for Audio-Visual Target Speaker Extraction with Selective Auditory Attention (TASLP)☆30Feb 28, 2025Updated last year
- SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios☆268Jan 22, 2025Updated last year
- An efficient speech separation method☆294Apr 11, 2024Updated last year