Official Implementation of LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.
☆32Nov 9, 2025Updated 3 months ago
Alternatives and similar repositories for lauraTSE_code
Users that are interested in lauraTSE_code are comparing it to the libraries listed below
Sorting:
- Power-Guided Grouped SRU for Real-Time Causal Audio-Visual Speech Separation☆23Nov 4, 2025Updated 3 months ago
- Query-conditioned target sound extraction model☆30Mar 25, 2025Updated 11 months ago
- ☆15Jun 15, 2022Updated 3 years ago
- The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…☆93Sep 2, 2025Updated 5 months ago
- LLaSE: Maximizing Acoustic Preservation for LLaMA based Speech Enhancement☆16Jul 11, 2025Updated 7 months ago
- ☆20Aug 25, 2025Updated 6 months ago
- ☆14Jul 1, 2024Updated last year
- Official code of SenSE.☆74Oct 30, 2025Updated 4 months ago
- Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models☆56Apr 14, 2025Updated 10 months ago
- Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"☆44Jul 10, 2024Updated last year
- Official implementation for FlowSep☆70Jan 2, 2025Updated last year
- LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement☆98Apr 1, 2025Updated 11 months ago
- Target Speaker Extraction Toolkit☆247Oct 4, 2025Updated 4 months ago
- Official baseline for ICASSP 2026 URGENT Challenge Track 2 (Speech Quality Assessment)☆27Jan 8, 2026Updated last month
- A toolkit for researchers in the multimodal sound separation.☆16Oct 20, 2023Updated 2 years ago
- Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"☆43Oct 30, 2025Updated 4 months ago
- ☆44Jul 5, 2025Updated 7 months ago
- ☆21Jul 16, 2025Updated 7 months ago
- Official baseline, dataset and evaluation scripts for the ICASSP 2026 URGENT challenge.☆32Nov 12, 2025Updated 3 months ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆20Sep 1, 2023Updated 2 years ago
- Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction☆46Nov 19, 2024Updated last year
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆46May 16, 2025Updated 9 months ago
- multi-scale time domain speaker extraction☆71Jun 7, 2021Updated 4 years ago
- ☆24Aug 29, 2025Updated 6 months ago
- ☆11Jun 6, 2022Updated 3 years ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- Official code release for "TDFNet: An Efficient Audio-Visual Speech Separation Model with Top-down Fusion", accepted ICIST 2023☆12Mar 17, 2024Updated last year
- ☆68Dec 30, 2025Updated 2 months ago
- Official code for MUSE: Flexible Voiceprint Receptive Fields and Multi-Path Fusion Enhanced Taylor Transformer for U-Net-based Speech Enh…☆54Mar 5, 2025Updated 11 months ago
- Llama-Mimi is a speech language model that uses a unified tokenizer (Mimi) and a single Transformer decoder (Llama) to jointly model sequ…☆27Sep 20, 2025Updated 5 months ago
- The code about “LABNet: A Lightweight Attentive Beamforming Network for Ad-hoc Multichannel Microphone Invariant Real-Time Speech Enhance…☆38Oct 10, 2025Updated 4 months ago
- ASLP Summer Inter@NPU☆12Jul 30, 2024Updated last year
- Official implementation of "Video-Foley: Two-Stage Video-To-Sound Generation via Temporal Event Condition For Foley Sound". IEEE TASLP 20…☆17Updated this week
- Room impulse response simulation for various array architectures using Monte-Carlo simulation and quaternions (Python)☆17Updated this week
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…☆55Aug 15, 2025Updated 6 months ago
- This is the audio sample repository for speech separation model "MossFormer2".☆171Nov 28, 2024Updated last year
- Da - ECHO - RetrievAl - daTasEt☆34Jul 7, 2024Updated last year
- ☆56Jan 25, 2026Updated last month
- ☆11Oct 14, 2023Updated 2 years ago