MLSpeech / ssl_diarizationLinks

Self-supervised Speaker Diarization Interspeech 2022 Implementation

☆8

Alternatives and similar repositories for ssl_diarization

Users that are interested in ssl_diarization are comparing it to the libraries listed below

Sorting:

mispchallenge / misp2022_baseline
☆30Updated 2 years ago
sinhat98 / adapter-wavlm
☆43Updated 2 years ago
NaoyukiKanda / LibriSpeechMix
☆36Updated 4 years ago
Maokui-He / NSD-MA-MSE
A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"
☆57Updated 10 months ago
Hunterhuan / sphereface2_speaker_verification
Exploring Binary Classification Loss for Speaker Verification
☆17Updated 2 years ago
VoxBlink / ScriptsForVoxBlink
A repo containing download guidance and corresponding scripts of the VoxBlink dataset.
☆28Updated last year
khhungg / BSSE-SE
Boosting Self-Supervised Embeddings for Speech Enhancement
☆47Updated 3 years ago
lin9x / AV-Sepformer
☆55Updated 2 years ago
zexupan / MuSE
☆39Updated 8 months ago
mkunes / w2v2_audioFrameClassification
wav2vec2 audio classification for prosodic boundary detection and other tasks
☆42Updated last year
nttcslab-sp / EEND-vector-clustering
This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…
☆77Updated 2 years ago
Audio-WestlakeU / UMA-ASR
This repository is the official implementation of unimodal aggregation (UMA) for automaticspeech recognition (ASR).
☆28Updated 7 months ago
cogmhear / avse_challenge
COG-MHEAR Audio-Visual Speech Enhancement Challenge
☆41Updated 2 months ago
JunyiPeng00 / SLT22_MultiHead-Factorized-Attentive-Pooling
An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification
☆22Updated 10 months ago
BUTSpeechFIT / EEND
☆86Updated 3 months ago
chaufanglin / Normal2Whisper
Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"
☆11Updated 9 months ago
nttcslab-sp / mamba-diarization
Official repository for Mamba-based Segmentation Model for Speaker Diarization
☆37Updated 2 months ago
mispchallenge / MISP-2023-Challenge-Baseline
☆25Updated last year
merlresearch / tssep
TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings
☆33Updated 10 months ago
sky1456723 / Pytorch-MBNet
A pytorch implementation of MBNET: MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK
☆60Updated 3 years ago
leibniz-future-lab / SelfDistill-SER
☆19Updated 2 years ago
HuangZiliAndy / SSL_for_multitalker
ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS
☆30Updated 2 years ago
wyw97 / DENSE
ICASSP2025Dynamic Embedding Causal Target Speech Extraction
☆3Updated 4 months ago
Sanyuan-Chen / CSS_with_Conformer
Code for the ICASSP-2021 paper: Continuous Speech Separation with Conformer.
☆118Updated 2 years ago
mubingshen / MLC-SLM-Baseline
The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…
☆39Updated 2 months ago
Lhx94As / PHO-LID
PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification
☆21Updated last year
YChenL / DS-TDNN
Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch
☆41Updated last year
TaoRuijie / MFV-KSD
Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)
☆20Updated last year
BUTSpeechFIT / EEND_dataprep
☆57Updated 4 months ago
soumimaiti / speechlmscore_tool
☆32Updated 8 months ago