Official Implementation of LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.
☆33Nov 9, 2025Updated 4 months ago
Alternatives and similar repositories for lauraTSE_code
Users that are interested in lauraTSE_code are comparing it to the libraries listed below
Sorting:
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆97Sep 2, 2025Updated 6 months ago
- ☆15Jun 15, 2022Updated 3 years ago
- Query-conditioned target sound extraction model☆30Mar 25, 2025Updated 11 months ago
- Power-Guided Grouped SRU for Real-Time Causal Audio-Visual Speech Separation☆24Nov 4, 2025Updated 4 months ago
- ☆20Aug 25, 2025Updated 6 months ago
- ☆14Jul 1, 2024Updated last year
- ☆21Jul 16, 2025Updated 8 months ago
- ☆46Jul 5, 2025Updated 8 months ago
- Target Speaker Extraction Toolkit☆250Oct 4, 2025Updated 5 months ago
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆57Apr 14, 2025Updated 11 months ago
- Official code of SenSE.☆76Oct 30, 2025Updated 4 months ago
- Official implementation for FlowSep☆70Jan 2, 2025Updated last year
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆102Apr 1, 2025Updated 11 months ago
- Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"☆44Jul 10, 2024Updated last year
- ☆16Jan 11, 2026Updated 2 months ago
- LLaSE: Maximizing Acoustic Preservation for LLaMA based Speech Enhancement☆16Jul 11, 2025Updated 8 months ago
- Official baseline for ICASSP 2026 URGENT Challenge Track 2 (Speech Quality Assessment)☆28Jan 8, 2026Updated 2 months ago
- A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.☆15Aug 22, 2023Updated 2 years ago
- ☆15Sep 6, 2021Updated 4 years ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- multi-scale time domain speaker extraction☆73Jun 7, 2021Updated 4 years ago
- A fully and partially fake speech dataset for evaluation☆14Nov 11, 2025Updated 4 months ago
- Official baseline, dataset and evaluation scripts for the ICASSP 2026 URGENT challenge.☆33Nov 12, 2025Updated 4 months ago
- This is the audio sample repository for speech separation model "MossFormer2".☆175Nov 28, 2024Updated last year
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆46May 16, 2025Updated 10 months ago
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆114Jan 28, 2026Updated last month
- DOSE: Diffusion Dropout with Adaptive Prior for Speech Enhancement, Conference on Neural Information Processing Systems (NeurIPS), 2023☆59May 16, 2025Updated 10 months ago
- Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"☆44Oct 30, 2025Updated 4 months ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆20Sep 1, 2023Updated 2 years ago
- ☆22Jul 10, 2025Updated 8 months ago
- Official code release for "TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion", accepted ICIST 2023☆12Mar 17, 2024Updated 2 years ago
- VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement☆46Sep 12, 2024Updated last year
- ☆25Aug 29, 2025Updated 6 months ago
- ☆24Sep 11, 2025Updated 6 months ago
- SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios☆267Jan 22, 2025Updated last year
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆47Nov 19, 2024Updated last year
- Official Repository of Paper: "Emilia-NV: A Non-Verbal Speech Dataset with Word-Level Annotation for Human-Like Speech Modeling"☆86Sep 18, 2025Updated 6 months ago
- ☆23Jul 17, 2024Updated last year