google/speaker-id

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/google/speaker-id)

google / speaker-id

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

☆453

Alternatives and similar repositories for speaker-id

Users that are interested in speaker-id are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

wq2012 / SpectralCluster
View on GitHub
Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.
☆552Sep 25, 2024Updated last year
wq2012 / awesome-diarization
View on GitHub
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
☆1,885Jul 7, 2026Updated last week
hitachi-speech / EEND
View on GitHub
End-to-End Neural Diarization
☆435Aug 30, 2021Updated 4 years ago
taylorlu / Speaker-Diarization
View on GitHub
speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
☆501Jul 1, 2021Updated 5 years ago
nryant / dscore
View on GitHub
Diarization scoring tools.
☆267Apr 8, 2026Updated 3 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
BUTSpeechFIT / EEND
View on GitHub
☆95Apr 24, 2025Updated last year
maum-ai / voicefilter
View on GitHub
Unofficial PyTorch implementation of Google AI's VoiceFilter system
☆1,214Jul 25, 2024Updated last year
BUTSpeechFIT / VBx
View on GitHub
Variational Bayes HMM over x-vectors diarization
☆286Jan 15, 2024Updated 2 years ago
DongKeon / Awesome-Speaker-Diarization
View on GitHub
Some comprehensive papers about speaker diarization
☆367Mar 24, 2026Updated 3 months ago
Edresson / VoiceSplit
View on GitHub
VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram
☆271Jul 25, 2024Updated last year
Audio-WestlakeU / FS-EEND
View on GitHub
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …
☆183May 7, 2026Updated 2 months ago
yistLin / dvector
View on GitHub
Speaker embedding (d-vector) trained with GE2E loss
☆289Jan 8, 2024Updated 2 years ago
joonson / voxconverse
View on GitHub
Spot the conversation: speaker diarisation in the wild
☆170Jul 26, 2022Updated 3 years ago
microsoft / UniSpeech
View on GitHub
UniSpeech - Large Scale Self-Supervised Learning for Speech
☆486Apr 5, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
wenet-e2e / wespeaker
View on GitHub
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
☆1,359Jul 8, 2026Updated last week
HarryVolek / PyTorch_Speaker_Verification
View on GitHub
PyTorch implementation of "Generalized End-to-End Loss for Speaker Verification" by Wan, Li et al.
☆598Jan 20, 2022Updated 4 years ago
BUTSpeechFIT / DiariZen
View on GitHub
A toolkit for speaker diarization.
☆500May 29, 2026Updated last month
BUTSpeechFIT / DiaPer
View on GitHub
☆69Feb 8, 2024Updated 2 years ago
facebookresearch / WavAugment
View on GitHub
A library for speech data augmentation in time-domain
☆689Aug 30, 2021Updated 4 years ago
desh2608 / dover-lap
View on GitHub
Python package for combining diarization system outputs.
☆94Oct 12, 2023Updated 2 years ago
google / uis-rnn
View on GitHub
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Su…
☆1,588Sep 25, 2024Updated last year
Snowdar / asv-subtools
View on GitHub
An Open Source Tools for Speaker Recognition
☆638Aug 5, 2024Updated last year
desh2608 / diarizer
View on GitHub
Clustering-based methods for overlapping diarization
☆84Jan 12, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
VITA-Group / AutoSpeech
View on GitHub
[InterSpeech 2020] "AutoSpeech: Neural Architecture Search for Speaker Recognition" by Shaojin Ding*, Tianlong Chen*, Xinyu Gong, Weiwei …
☆206Dec 8, 2022Updated 3 years ago
iver56 / torch-audiomentations
View on GitHub
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
☆1,160Nov 24, 2025Updated 7 months ago
Jungjee / RawNet
View on GitHub
Official repository for RawNet, RawNet2, and RawNet3
☆407Mar 21, 2024Updated 2 years ago
nttcslab-sp / EEND-vector-clustering
View on GitHub
This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…
☆81Oct 18, 2022Updated 3 years ago
manojpamk / pytorch_xvectors
View on GitHub
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
☆321Nov 11, 2020Updated 5 years ago
tango4j / Auto-Tuning-Spectral-Clustering
View on GitHub
This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"
☆125Apr 8, 2022Updated 4 years ago
huggingface / diarizers
View on GitHub
☆327Jun 14, 2024Updated 2 years ago
xuchenglin28 / speaker_extraction
View on GitHub
target speaker extraction and verification for multi-talker speech
☆210Jan 24, 2021Updated 5 years ago
Xflick / EEND_PyTorch
View on GitHub
A PyTorch implementation of End-to-End Neural Diarization
☆110Jun 19, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
joonaskalda / PixIT
View on GitHub
Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…
☆105Jan 10, 2025Updated last year
dodohow1011 / TS-VAD
View on GitHub
☆55Jan 15, 2021Updated 5 years ago
s3prl / s3prl
View on GitHub
Self-Supervised Speech Pre-training and Representation Learning Toolkit
☆2,557Mar 12, 2026Updated 4 months ago
pyannote / pyannote-audio
View on GitHub
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…
☆10,302Updated this week
wq2012 / SimpleDER
View on GitHub
A lightweight library to compute Diarization Error Rate (DER).
☆62Jan 14, 2026Updated 6 months ago
mravanelli / SincNet
View on GitHub
SincNet is a neural architecture for efficiently processing raw audio samples.
☆1,240Apr 28, 2021Updated 5 years ago
fgnt / meeteval
View on GitHub
MeetEval - A meeting transcription evaluation toolkit
☆171Jan 27, 2026Updated 5 months ago