JunyiPeng00 / SLT22_MultiHead-Factorized-Attentive-PoolingLinks

An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification

☆22

Alternatives and similar repositories for SLT22_MultiHead-Factorized-Attentive-Pooling

Users that are interested in SLT22_MultiHead-Factorized-Attentive-Pooling are comparing it to the libraries listed below

Sorting:

VoxBlink / ScriptsForVoxBlink
A repo containing download guidance and corresponding scripts of the VoxBlink dataset.
☆28Updated last year
sinhat98 / adapter-wavlm
☆45Updated 2 years ago
YoshikiMas / madeon-asr
[SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition
☆16Updated last year
yzyouzhang / Audio_Research_in_US
Audio Research in US. US-based professors who work on audio (music, speech, acoustics). For students who would like to apply for RA, PhD,…
☆27Updated 3 weeks ago
zxzhao0 / C2SER
We propose C2SER, a novel audio-language model designed to enhance the stability and accuracy of speech emotion recognition through conte…
☆39Updated 9 months ago
fgnt / speaker_reassignment
Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment
☆12Updated 10 months ago
Speech-Arena / speech_df_arena
☆27Updated 3 months ago
seongq / flowmse
(ICASSP 2025, official code)FlowSE: Flow Matching-based Speech Enhancement
☆76Updated 4 months ago
mispchallenge / misp2022_baseline
☆31Updated 2 years ago
fcumlin / DNSMOSPro
Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).
☆66Updated 5 months ago
msplabresearch / MSP-Podcast_Challenge_IS2025
MSP-Podcast Challenge Baseline Code for Interspeech 2025
☆28Updated last year
Maokui-He / NSD-MA-MSE
A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"
☆58Updated last year
ductuantruong / tcm_add
Official implementation of the INTERSPEECH 2024 paper: Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detect…
☆50Updated last year
sp-uhh / ears_benchmark
Generation scripts for EARS-WHAM and EARS-Reverb
☆41Updated 5 months ago
kaistmm / seed-pytorch
[INTERSPEECH 2025] Official code for "SEED: Speaker Embedding Enhancement Diffusion Model"
☆52Updated last month
hongfeixue / StutteringSpeechChallenge
SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
☆12Updated last year
Hunterhuan / sphereface2_speaker_verification
Exploring Binary Classification Loss for Speaker Verification
☆18Updated 2 years ago
Beilong-Tang / TSELM
Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models
☆50Updated 7 months ago
soumimaiti / speechlmscore_tool
☆32Updated last year
mubingshen / MLC-SLM-Baseline
The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…
☆46Updated 6 months ago
YUCHEN005 / Unified-Enhance-Separation
Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"
☆44Updated last year
lin9x / AV-Sepformer
☆58Updated 2 years ago
YChenL / DS-TDNN
Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch
☆41Updated 2 years ago
Audio-WestlakeU / UMA-ASR
This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).
☆34Updated 11 months ago
zelokuo / VPIDM
This is official repository of new SOTA diffusion models based method for speech enhancement
☆41Updated last year
HuangZiliAndy / SSL_for_multitalker
ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS
☆31Updated 2 years ago
khhungg / BSSE-SE
Boosting Self-Supervised Embeddings for Speech Enhancement
☆47Updated 3 years ago
Yaselley / SSL_Layerwise_Deepfake
SSL Layerwise analysis for speech deepfake detection
☆29Updated 4 months ago
NaoyukiKanda / LibriSpeechMix
☆37Updated 4 years ago
wngh1187 / Diff-SV
Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…
☆23Updated last year