Beilong-Tang/lauraTSE_code

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Beilong-Tang/lauraTSE_code)

Beilong-Tang / lauraTSE_code

Official Implementation of LauraTSE: Target Speaker Extraction using Auto-Regressive Decoder-Only Language Models.

☆37

Alternatives and similar repositories for lauraTSE_code

Users that are interested in lauraTSE_code are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

IiuZiKai / Evo_TSE
View on GitHub
☆17Apr 9, 2026Updated 3 months ago
ZBang / USEF-TSE
View on GitHub
☆70Jul 5, 2025Updated last year
xu-shitong / TSE-through-Positive-Negative-Enroll
View on GitHub
Official implementation of paper "Target Speaker Extraction through Comparing Noisy Positive and Negative Audio Enrollments"
☆20Updated this week
HaoFengyuan / X-TF-GridNet
View on GitHub
The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…
☆114Sep 2, 2025Updated 10 months ago
aleXiehta / AD-FlowTSE
View on GitHub
Adaptive Flow-Matching for Target Speaker Extraction
☆39Jul 13, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zexupan / USEV
View on GitHub
☆14Jul 1, 2024Updated 2 years ago
zexupan / avse_hybrid_loss
View on GitHub
☆16Jun 15, 2022Updated 4 years ago
wenet-e2e / wesep
View on GitHub
Target Speaker Extraction Toolkit
☆299Oct 4, 2025Updated 9 months ago
hyyan2k / PGUSE
View on GitHub
This is the official implementation of PGUSE
☆41Jun 7, 2025Updated last year
REAL-TSE / wesep-real-tse
View on GitHub
☆36Apr 14, 2026Updated 3 months ago
Aisaka0v0 / CLAPSep
View on GitHub
Query-conditioned target sound extraction model
☆30Mar 25, 2025Updated last year
ahmadikalkhorani / AVCrossNet
View on GitHub
☆16Jul 4, 2024Updated 2 years ago
urgent-challenge / urgent2026_challenge_track1
View on GitHub
Official baseline, dataset and evaluation scripts for the ICASSP 2026 URGENT challenge.
☆36Nov 12, 2025Updated 8 months ago
aispeech-lab / TinyWASE
View on GitHub
PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…
☆11Jun 28, 2021Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
itsnotacie / AAAI-26_SepPrune
View on GitHub
SepPrune: Structured Pruning for Efficient Deep Speech Separation-AAAI'26
☆15May 31, 2025Updated last year
xuchenglin28 / speaker_extraction_SpEx
View on GitHub
multi-scale time domain speaker extraction
☆81Jun 7, 2021Updated 5 years ago
JusperLee / TFACM
View on GitHub
☆23Jul 16, 2025Updated last year
audiosae / audio-sae
View on GitHub
Demo for AudioSAE paper
☆15Apr 26, 2026Updated 2 months ago
juhayna-zh / BSRNN-speech-preprocess
View on GitHub
A solution to denoising and separating for two-speaker-mixed noisy speech, using a BSRNN inspired network.
☆15Aug 22, 2023Updated 2 years ago
mborsdorf / UniversalSpeakerExtraction
View on GitHub
☆15Sep 6, 2021Updated 4 years ago
Audio-WestlakeU / CleanMel
View on GitHub
Pytorch implementation of "CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR".
☆94Feb 2, 2026Updated 5 months ago
YUCHEN005 / Unified-Enhance-Separation
View on GitHub
Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"
☆45Jul 10, 2024Updated 2 years ago
yu-haoyuan / fd-badcat
View on GitHub
fd-sds
☆20Apr 8, 2026Updated 3 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Kevin-naticl / LLaSE-G1
View on GitHub
LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement
☆105Apr 1, 2025Updated last year
cmots / UniSS
View on GitHub
Official inference code for UniSS: Unified Expressive Speech-to-Speech Translation with Your Voice.
☆31May 30, 2026Updated last month
Beilong-Tang / TSELM
View on GitHub
Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models
☆60Apr 14, 2025Updated last year
ahmadikalkhorani / CrossNet
View on GitHub
☆36Apr 11, 2024Updated 2 years ago
tan90xx / distillw2n
View on GitHub
🤫A Lightweight One-Shot Whisper to Normal Voice Conversion Model Using Distillation of Self-Supervised Features
☆26Dec 10, 2025Updated 7 months ago
koudounasalkis / voc2vec
View on GitHub
This repository contains the code for the paper "voc2vec: A Foundation Model for Non-Verbal Vocalization", accepted at ICASSP 2025.
☆57Apr 14, 2025Updated last year
ASLP-lab / FastTurn
View on GitHub
☆33May 19, 2026Updated 2 months ago
merlresearch / sebbs
View on GitHub
Prediction of sound event bounding boxes (SEBBs)
☆35Aug 2, 2024Updated last year
ASLP-lab / Smart-Glass-Challenge
View on GitHub
☆17Jun 16, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
WingSingFung / TISDiSS
View on GitHub
Official implementation of TISDiSS, a scalable framework for discriminative source separation.
☆16Oct 19, 2025Updated 9 months ago
JusperLee / Swift-Net
View on GitHub
Power-Guided Grouped SRU for Real-Time Causal Audio-Visual Speech Separation
☆26Updated this week
caoruitju / RUI_SE
View on GitHub
VOICOR: A Residual Iterative Voice Correction Framework for Monaural Speech Enhancement
☆46Sep 12, 2024Updated last year
alibabasglab / MossFormer2
View on GitHub
This is the audio sample repository for speech separation model "MossFormer2".
☆188Nov 28, 2024Updated last year
JusperLee / Look2hear
View on GitHub
A toolkit for researchers in the multimodal sound separation.
☆16Oct 20, 2023Updated 2 years ago
ASLP-lab / Speaker-Reasoner
View on GitHub
Speaker-Reasoner: Scaling Interaction Turns and Reasoning Patterns for Timestamped Speaker-Attributed ASR
☆93May 13, 2026Updated 2 months ago
ASLP-lab / OSUM-Pangu
View on GitHub
An Open-Source Multidimension Speech Understanding Foundation Model Built upon OpenPangu on Ascend NPUs
☆33Mar 15, 2026Updated 4 months ago