Picovoice/speaker-diarization-benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Picovoice/speaker-diarization-benchmark)

Picovoice / speaker-diarization-benchmark

Speaker diarization benchmark framework

☆42

Alternatives and similar repositories for speaker-diarization-benchmark

Users that are interested in speaker-diarization-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

joonson / voxconverse
View on GitHub
Spot the conversation: speaker diarisation in the wild
☆171Jul 26, 2022Updated 4 years ago
Picovoice / text-to-speech-benchmark
View on GitHub
Text-to-Speech Benchmark
☆26Apr 2, 2026Updated 3 months ago
Picovoice / falcon
View on GitHub
On-device speaker diarization powered by deep learning
☆75Updated this week
FrenchKrab / IS2023-powerset-diarization
View on GitHub
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
☆96Oct 18, 2023Updated 2 years ago
X-LANCE / BER
View on GitHub
Balanced Error Rate for Speaker Diarization
☆32Feb 28, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
VITA-Group / Audio-Lottery
View on GitHub
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…
☆32Apr 8, 2022Updated 4 years ago
ex3ndr / supervoice-hybrid
View on GitHub
My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one
☆26Aug 5, 2024Updated last year
ORI-Muchim / Efficient-Speech
View on GitHub
Lightweight Korean TTS Model based on FastSpeech2
☆15Mar 4, 2026Updated 4 months ago
joonaskalda / PixIT
View on GitHub
Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…
☆105Jan 10, 2025Updated last year
Boot-Error / highlight_butler
View on GitHub
Highlight Butler manages highlights in my highlight library
☆11Jul 6, 2021Updated 5 years ago
Audio-WestlakeU / FS-EEND
View on GitHub
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …
☆183May 7, 2026Updated 2 months ago
pyf98 / DPHuBERT
View on GitHub
INTERSPEECH 2023: "DPHuBERT: Joint Distillation and Pruning of Self-Supervised Speech Models"
☆118Jan 26, 2024Updated 2 years ago
huangruizhe / audio
View on GitHub
Data manipulation and transformation for audio signal processing, powered by PyTorch
☆10Sep 30, 2024Updated last year
Picovoice / voice-activity-benchmark
View on GitHub
Voice activity engine benchmark framework
☆23Jan 14, 2026Updated 6 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
bshall / dusted
View on GitHub
DUSTED: Spoken-Term Discovery using Discrete Speech Units
☆17Oct 2, 2024Updated last year
kyegomez / Audio-xLSTMs
View on GitHub
Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch
☆20Updated this week
nryant / dscore
View on GitHub
Diarization scoring tools.
☆268Apr 8, 2026Updated 3 months ago
pyannote / pyannote-database
View on GitHub
Reproducible experimental protocols for multimedia (audio, video, text) database
☆120Mar 1, 2026Updated 4 months ago
BUTSpeechFIT / DiaPer
View on GitHub
☆69Feb 8, 2024Updated 2 years ago
lifeiteng / Aligner-SUPERB
View on GitHub
Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark
☆39May 7, 2025Updated last year
felixbur / Speechalyzer
View on GitHub
label and annotate large number of speech data files
☆12May 5, 2021Updated 5 years ago
egruttadauria98 / SSpaVAlDo
View on GitHub
☆37Jan 6, 2026Updated 6 months ago
ArenAcikgoz / Whisper-Alignment
View on GitHub
Forced alignment decoder for Whisper.
☆16Mar 13, 2024Updated 2 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
kjw11 / Speaker-Aware-CTC
View on GitHub
Speaker-aware CTC (SACTC) for multi-talker overlapped speech recognition.
☆22May 26, 2025Updated last year
wenet-e2e / WeSpeech-AI
View on GitHub
Open Source Speech/Text Data on AI
☆19Sep 13, 2022Updated 3 years ago
Edresson / VoiceSplit
View on GitHub
VoiceSplit: Targeted Voice Separation by Speaker-Conditioned Spectrogram
☆271Jul 25, 2024Updated 2 years ago
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
usernamehw / vscode-snippets-in-markdown
View on GitHub
Write and keep snippets for VSCode in a markdown file.
☆15Jul 23, 2023Updated 3 years ago
pengzhendong / audio-pipeline
View on GitHub
☆23Oct 17, 2024Updated last year
BUTSpeechFIT / TS-ASR-Whisper
View on GitHub
☆116Jun 29, 2026Updated last month
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
argmaxinc / OpenBench
View on GitHub
Open-source reproducible benchmarks from Argmax
☆88Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kyegomez / USM
View on GitHub
Implementation of Google's USM speech model in Pytorch
☆36Jul 20, 2026Updated last week
Vaibhavs10 / on-device-llm-playground
View on GitHub
A repo with scripts to test and play around with Facebook's recent llama models! 🤗
☆28Jul 25, 2023Updated 3 years ago
felixbur / Emofilt
View on GitHub
Emofilt is a program to simulate emotional arousal with speech synthesis based on the free-for-non-commercial-use MBROLA synthesis engine…
☆14Mar 17, 2022Updated 4 years ago
IU-SAIGE / pse
View on GitHub
Efficient Personalized Speech Enhancement through Self-Supervised Learning
☆23Mar 12, 2023Updated 3 years ago
wonjune-kang / expressive-speech-retrieval
View on GitHub
Expressive Speech Retrieval using Natural Language Descriptions of Speaking Style
☆15Aug 18, 2025Updated 11 months ago
SWivid / AUV
View on GitHub
An All-in-One Speech, Sound, Music Codec with Single Nested Codebook
☆28Oct 11, 2025Updated 9 months ago
desh2608 / spyder
View on GitHub
Simple Python package for fast DER computation
☆35Jun 29, 2023Updated 3 years ago