pyannote/AMI-diarization-setup

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pyannote/AMI-diarization-setup)

pyannote / AMI-diarization-setup

☆47

Alternatives and similar repositories for AMI-diarization-setup

Users that are interested in AMI-diarization-setup are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

BUTSpeechFIT / AMI-diarization-setup
View on GitHub
☆54Oct 17, 2023Updated 2 years ago
pyannote / pyannote-database
View on GitHub
Reproducible experimental protocols for multimedia (audio, video, text) database
☆119Mar 1, 2026Updated 4 months ago
wangkenpu / WSJ2WAV
View on GitHub
Convert WSJ sphere format to waveform and do data simulation.
☆16Feb 20, 2020Updated 6 years ago
nryant / dscore
View on GitHub
Diarization scoring tools.
☆267Apr 8, 2026Updated 3 months ago
joonson / voxconverse
View on GitHub
Spot the conversation: speaker diarisation in the wild
☆170Jul 26, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
hitachi-speech / EEND
View on GitHub
End-to-End Neural Diarization
☆435Aug 30, 2021Updated 4 years ago
BUTSpeechFIT / EEND
View on GitHub
☆95Apr 24, 2025Updated last year
kamilakesbi / DiarizersLM
View on GitHub
☆15Jul 16, 2024Updated 2 years ago
juanmc2005 / rttm-viewer
View on GitHub
Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way
☆48Apr 19, 2023Updated 3 years ago
DongKeon / Awesome-Speaker-Diarization
View on GitHub
Some comprehensive papers about speaker diarization
☆367Mar 24, 2026Updated 3 months ago
BUTSpeechFIT / DiaPer
View on GitHub
☆69Feb 8, 2024Updated 2 years ago
liutaocode / DiarizationVisualization
View on GitHub
Visualization tools for audio-only and multi-modal speaker diarization dataset
☆13Oct 27, 2023Updated 2 years ago
yucongzh / online_speaker_diarization
View on GitHub
☆15Jul 11, 2022Updated 4 years ago
ahmedshah1494 / speech_robust_bench
View on GitHub
☆18Apr 24, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
FrenchKrab / IS2023-powerset-diarization
View on GitHub
Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.
☆96Oct 18, 2023Updated 2 years ago
robin1001 / vad
View on GitHub
simple energy vad
☆19Jun 3, 2017Updated 9 years ago
lucasondel / BayesianModels.jl
View on GitHub
Julia package for Probabilistic Canonical Correlation Analysis
☆12Mar 30, 2022Updated 4 years ago
Audio-WestlakeU / FS-EEND
View on GitHub
The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …
☆183May 7, 2026Updated 2 months ago
BUTSpeechFIT / DiariZen
View on GitHub
A toolkit for speaker diarization.
☆500May 29, 2026Updated last month
pyannote / hf-speaker-diarization-3.1
View on GitHub
Mirror of hf.co/pyannote/speaker-diarization-3.1
☆33Jan 7, 2024Updated 2 years ago
BUTSpeechFIT / vae_dolphin
View on GitHub
☆10Jan 26, 2021Updated 5 years ago
jymh / SAP2-ASR
View on GitHub
☆26Jan 23, 2026Updated 5 months ago
BUTSpeechFIT / VBx
View on GitHub
Variational Bayes HMM over x-vectors diarization
☆287Jan 15, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
BUTSpeechFIT / DiCoW
View on GitHub
☆100Jan 28, 2026Updated 5 months ago
wq2012 / SpectralCluster
View on GitHub
Python re-implementation of the (constrained) spectral clustering algorithms used in Google's speaker diarization papers.
☆552Sep 25, 2024Updated last year
wq2012 / SimpleDER
View on GitHub
A lightweight library to compute Diarization Error Rate (DER).
☆62Jan 14, 2026Updated 6 months ago
Lhx94As / Awesome-Spoken-Language-Identification
View on GitHub
An awesome spoken LID repository. (Working in progress
☆109Apr 22, 2024Updated 2 years ago
wq2012 / awesome-diarization
View on GitHub
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
☆1,885Jul 7, 2026Updated 2 weeks ago
FrenchKrab / datasets-pyannote
View on GitHub
Automatically setup the AISHELL-4 and MSDWild dataset for usage with pyannote-database (and pyannote-audio)
☆15Oct 22, 2025Updated 8 months ago
IU-SAIGE / pse
View on GitHub
Efficient Personalized Speech Enhancement through Self-Supervised Learning
☆23Mar 12, 2023Updated 3 years ago
BUTSpeechFIT / ASR-hybrid-decoding
View on GitHub
☆17Nov 25, 2019Updated 6 years ago
tango4j / Python-Speaker-Diarization
View on GitHub
Python3 code for the IEEE SPL paper "Auto-Tuning Spectral Clustering for SpeakerDiarization Using Normalized Maximum Eigengap"
☆11Apr 6, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
joonaskalda / PixIT
View on GitHub
Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…
☆105Jan 10, 2025Updated last year
dodohow1011 / TS-VAD
View on GitHub
☆55Jan 15, 2021Updated 5 years ago
juanmc2005 / torch-plda
View on GitHub
PyTorch implementation of PLDA as described in https://ravisoji.com/assets/papers/ioffe2006probabilistic.pdf
☆15Oct 16, 2020Updated 5 years ago
phonexiaresearch / VBx-training-recipe
View on GitHub
☆33Mar 11, 2022Updated 4 years ago
dengcunqin / noise-reduction
View on GitHub
noise reduction
☆17Jul 3, 2024Updated 2 years ago
BUTSpeechFIT / diacorrect
View on GitHub
Error correction back-end for speaker diarization
☆18Sep 26, 2023Updated 2 years ago
X-LANCE / MSDWILD
View on GitHub
[INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.
☆65Jan 24, 2024Updated 2 years ago