yucongzh/online_speaker_diarization

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yucongzh/online_speaker_diarization)

yucongzh / online_speaker_diarization

☆15

Alternatives and similar repositories for online_speaker_diarization

Users that are interested in online_speaker_diarization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Hunterhuan / sphereface2_speaker_verification
View on GitHub
Exploring Binary Classification Loss for Speaker Verification
☆18Jul 18, 2023Updated 3 years ago
qinxiaoyi / Simple-Attention-Module-based-Speaker-Verification-with-Iterative-Noisy-Label-Detection
View on GitHub
☆12Jun 14, 2022Updated 4 years ago
zaocan666 / DyViSE
View on GitHub
Dynamic vision-guided speaker embedding for audio-visual speaker diarization
☆12Jul 5, 2022Updated 4 years ago
wenet-e2e / wesignal
View on GitHub
Production first, nn-based on-device signal processing toolkit.
☆63May 30, 2023Updated 3 years ago
dihardchallenge / dihard3_baseline
View on GitHub
☆30Jul 21, 2022Updated 4 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
MagicHub-io / MagicData-RAMC
View on GitHub
MagicData-RAMC Dataset and Baseline
☆64Sep 13, 2022Updated 3 years ago
SSTC-Challenge / SSTC2024_baseline_system
View on GitHub
☆12Jun 14, 2024Updated 2 years ago
pkufool / simple-wer
View on GitHub
A simple command line tool to calculate WER for ASR.
☆14Updated this week
desh2608 / diarizer
View on GitHub
Clustering-based methods for overlapping diarization
☆84Jan 12, 2024Updated 2 years ago
nttcslab-sp / mamba-diarization
View on GitHub
Official repository for Mamba-based Segmentation Model for Speaker Diarization
☆47May 13, 2025Updated last year
sholokhovalexey / online-speaker-clustering
View on GitHub
[ICASSP'23] Online speaker clustering
☆18Feb 22, 2026Updated 5 months ago
cvqluu / simple_diarizer
View on GitHub
Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code
☆158May 2, 2024Updated 2 years ago
dr-pato / SSGD
View on GitHub
Code of the paper "Low-Latency Speech Separation Guided Diarization for Telephone Conversations"
☆15Dec 22, 2022Updated 3 years ago
doerlbh / MiniVox
View on GitHub
Code for our ACML and INTERSPEECH papers: "Speaker Diarization as a Fully Online Bandit Learning Problem in MiniVox".
☆29Sep 20, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
HHousen / speaker-change-detection
View on GitHub
Speaker change detection using SincNet and an LSTM/Transformer
☆57May 26, 2025Updated last year
jwr1995 / DTCN
View on GitHub
☆19Oct 26, 2023Updated 2 years ago
liutaocode / AwesomeDiarizationDataset
View on GitHub
Both audio-only and audio-visual speaker diarization datasets are listed here.
☆16Feb 22, 2023Updated 3 years ago
Kuray107 / S4ND-U-Net_speech_enhancement
View on GitHub
☆33May 17, 2024Updated 2 years ago
JiJiJiang / ASV-Anti-Spoofing-DADA
View on GitHub
Dual-Adversarial Domain Adaptation for replay spoofing detection in automatic speaker verification.
☆19Jul 17, 2026Updated last week
FrenchKrab / datasets-pyannote
View on GitHub
Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)
☆15Oct 22, 2025Updated 9 months ago
RicherMans / UIT_Mobile
View on GitHub
Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"
☆24Mar 6, 2023Updated 3 years ago
IDRnD / VoxTube
View on GitHub
The VoxTube dataset official repository
☆71Feb 14, 2024Updated 2 years ago
lucasnewman / vocos-mlx
View on GitHub
Implementation of 'Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis', in MLX
☆24Oct 30, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kamilakesbi / DiarizersLM
View on GitHub
☆15Jul 16, 2024Updated 2 years ago
X-LANCE / BER
View on GitHub
Balanced Error Rate for Speaker Diarization
☆32Feb 28, 2023Updated 3 years ago
DonkeyShot21 / uis-rnn-sml
View on GitHub
A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)
☆61Apr 15, 2020Updated 6 years ago
Andong-Li-speech / G2Net
View on GitHub
The implementation of G2Net, the extension of GaGNet and is in submission to T-ASLP
☆19Apr 27, 2022Updated 4 years ago
liyunlongaaa / NSD-MS2S
View on GitHub
CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…
☆88Jun 17, 2025Updated last year
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
BUTSpeechFIT / diacorrect
View on GitHub
Error correction back-end for speaker diarization
☆18Sep 26, 2023Updated 2 years ago
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
View on GitHub
☆11Nov 5, 2021Updated 4 years ago
fgnt / graph_pit
View on GitHub
☆42Oct 14, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
desh2608 / dover-lap
View on GitHub
Python package for combining diarization system outputs.
☆94Oct 12, 2023Updated 2 years ago
kimsunwiub / BLOOM-Net
View on GitHub
Source code for "BLOOM-Net: Blockwise Optimization for Masking Networks Toward Scalable and Efficient Speech Enhancement"
☆14Feb 13, 2022Updated 4 years ago
IU-SAIGE / pse
View on GitHub
Efficient Personalized Speech Enhancement through Self-Supervised Learning
☆23Mar 12, 2023Updated 3 years ago
zyzisyz / mfa_conformer
View on GitHub
☆160Jan 9, 2023Updated 3 years ago
YoungJay0612 / Speech-Simulation-Tools
View on GitHub
语音增强领域的相关数据仿真工具和方法汇总--持续更新
☆45Jul 11, 2024Updated 2 years ago
ASLP-lab / LLaSE-G1
View on GitHub
LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement
☆47Mar 10, 2025Updated last year
mbzuai-nlp / sttatts
View on GitHub
☆31Oct 29, 2024Updated last year