xuchenglin28/speaker_extraction

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xuchenglin28/speaker_extraction)

xuchenglin28 / speaker_extraction

target speaker extraction and verification for multi-talker speech

☆211

Alternatives and similar repositories for speaker_extraction

Users that are interested in speaker_extraction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xuchenglin28 / speaker_extraction_SpEx
View on GitHub
multi-scale time domain speaker extraction
☆81Jun 7, 2021Updated 5 years ago
gemengtju / SpEx_Plus
View on GitHub
SpEx+(tied) source code
☆96Jul 6, 2023Updated 3 years ago
xuchenglin28 / speech_separation
View on GitHub
Constrained Permutation Invariant Training, Speech Separation
☆52Jan 24, 2021Updated 5 years ago
gemengtju / L-SpEx
View on GitHub
☆39Feb 23, 2022Updated 4 years ago
haoxiangsnr / SpEx
View on GitHub
Implementation of "SpEx: Multi-Scale Time Domain Speaker Extraction Network".
☆37Jul 19, 2020Updated 6 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
xuchenglin28 / target_speaker_verification
View on GitHub
target speaker verification (tSV), ts-vector, universal speaker verification for single- and multi-talker speech
☆15Jan 26, 2021Updated 5 years ago
BUTSpeechFIT / speakerbeam
View on GitHub
☆145Oct 25, 2021Updated 4 years ago
mborsdorf / UniversalSpeakerExtraction
View on GitHub
☆15Sep 6, 2021Updated 4 years ago
JorisCos / LibriMix
View on GitHub
An open source dataset for source separation
☆502Feb 9, 2024Updated 2 years ago
yluo42 / TAC
View on GitHub
transform-average-concatenate (TAC) method for end-to-end microphone permutation and number invariant ad-hoc beamforming.
☆311Jun 15, 2021Updated 5 years ago
gemengtju / Tutorial_Separation
View on GitHub
This repo summarizes the tutorials, datasets, papers, codes and tools for speech separation and speaker extraction task. You are kindly i…
☆484Jan 9, 2021Updated 5 years ago
jyhan03 / channel-decorrelation
View on GitHub
multi-channel target speech extraction with channel decorrelation and target speaker adaptation
☆27Feb 19, 2021Updated 5 years ago
JusperLee / Speech-Separation-Paper-Tutorial
View on GitHub
A must-read paper for speech separation based on neural networks
☆952Aug 11, 2025Updated 11 months ago
zexupan / USEV
View on GitHub
☆14Jul 1, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
zexupan / avse_hybrid_loss
View on GitHub
☆16Jun 15, 2022Updated 4 years ago
JusperLee / Dual-Path-RNN-Pytorch
View on GitHub
Dual-path RNN: efficient long sequence modeling for time-domain single-channel speech separation implemented by Pytorch
☆468Feb 14, 2023Updated 3 years ago
maum-ai / voicefilter
View on GitHub
Unofficial PyTorch implementation of Google AI's VoiceFilter system
☆1,214Jul 25, 2024Updated 2 years ago
tencent-ailab / FRA-RIR
View on GitHub
☆214Dec 4, 2023Updated 2 years ago
Enny1991 / beamformers
View on GitHub
Easy to use Beamformers for multi-channel speech separation/enhancement
☆216Jan 26, 2021Updated 5 years ago
chenzhuo1011 / libri_css
View on GitHub
Libri-CSS: dataset and evaluation pipeline
☆157Jan 18, 2023Updated 3 years ago
fgnt / sms_wsj
View on GitHub
SMS-WSJ: Spatialized Multi-Speaker Wall Street Journal database for multi-channel source separation and recognition
☆131Jun 7, 2024Updated 2 years ago
kaituoxu / Conv-TasNet
View on GitHub
A PyTorch implementation of Conv-TasNet described in "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" with Permuta…
☆771Apr 6, 2023Updated 3 years ago
wenet-e2e / wesep
View on GitHub
Target Speaker Extraction Toolkit
☆300Oct 4, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AkojimaSLP / Beamforming-for-speech-enhancement
View on GitHub
simple delaysum, MVDR and CGMM-MVDR
☆288Jan 19, 2019Updated 7 years ago
DavidDiazGuerra / gpuRIR
View on GitHub
Python library for Room Impulse Response (RIR) simulation with GPU acceleration
☆607Jul 18, 2025Updated last year
Edresson / VoiceSplit
View on GitHub
VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram
☆271Jul 25, 2024Updated 2 years ago
lin9x / AV-Sepformer
View on GitHub
☆65Jun 28, 2023Updated 3 years ago
naplab / Conv-TasNet
View on GitHub
☆337Feb 28, 2020Updated 6 years ago
ShiZiqiang / dual-path-RNNs-DPRNNs-based-speech-separation
View on GitHub
A PyTorch implementation of dual-path RNNs (DPRNNs) based speech separation described in "Dual-path RNN: efficient long sequence modeling…
☆182Aug 5, 2020Updated 5 years ago
HaoFengyuan / X-TF-GridNet
View on GitHub
The implementation of "X-TF-GridNet: A Time-Frequency Domain Target Speaker Extraction Network with Adaptive Speaker Embedding Fusion", w…
☆115Sep 2, 2025Updated 10 months ago
fgnt / pb_chime5
View on GitHub
Speech enhancement system for the CHiME-5 dinner party scenario
☆111Feb 6, 2025Updated last year
Andong-Li-speech / EaBNet
View on GitHub
This is the repo of the manuscript "Embedding and Beamforming: All-Neural Causal Beamformer for Multichannel Speech Enhancement", which w…
☆107Jun 10, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
popcornell / SparseLibriMix
View on GitHub
☆73Feb 15, 2021Updated 5 years ago
Sanyuan-Chen / CSS_with_Conformer
View on GitHub
Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.
☆120Mar 18, 2023Updated 3 years ago
funcwj / conv-tasnet
View on GitHub
A PyTorch implementation of "TasNet: Surpassing Ideal Time-Frequency Masking for Speech Separation" (see recipes in aps framework https:/…
☆219Jul 6, 2023Updated 3 years ago
aliutkus / speechmetrics
View on GitHub
A wrapper around speech quality metrics MOSNet, BSSEval, STOI, PESQ, SRMR, SISDR
☆1,050Jul 5, 2023Updated 3 years ago
ujscjj / DPTNet
View on GitHub
☆119Jan 8, 2021Updated 5 years ago
hangtingchen / Beam-Guided-TasNet
View on GitHub
Beam-guided TasNet
☆58Sep 8, 2022Updated 3 years ago
urgent-challenge / urgent2024_challenge
View on GitHub
Official data preparation scripts for the URGENT 2024 Challenge
☆90May 21, 2025Updated last year