A toolkit for speaker diarization.
☆406Feb 9, 2026Updated 3 weeks ago
Alternatives and similar repositories for DiariZen
Users that are interested in DiariZen are comparing it to the libraries listed below
Sorting:
- ☆103Updated this week
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆47May 13, 2025Updated 9 months ago
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆105Jan 10, 2025Updated last year
- ☆67Feb 8, 2024Updated 2 years ago
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆166Dec 12, 2025Updated 2 months ago
- Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit☆1,218Feb 11, 2026Updated 3 weeks ago
- Some comprehensive papers about speaker diarization☆336May 22, 2025Updated 9 months ago
- Open source inference code for Rev's model☆434Apr 22, 2025Updated 10 months ago
- The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"☆186Sep 24, 2025Updated 5 months ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆84Jun 17, 2025Updated 8 months ago
- Target Speaker Extraction Toolkit☆247Oct 4, 2025Updated 5 months ago
- Python package for combining diarization system outputs.☆92Oct 12, 2023Updated 2 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆93Oct 18, 2023Updated 2 years ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆70Updated this week
- ☆92Apr 24, 2025Updated 10 months ago
- ☆36Jan 6, 2026Updated 2 months ago
- ONNX Inference of Pyannote Segmentation☆97Dec 23, 2024Updated last year
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆60Sep 19, 2024Updated last year
- Diarization scoring tools.☆262Mar 28, 2023Updated 2 years ago
- Official repository of SepReformer for speech separation☆246Jan 13, 2025Updated last year
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆13Feb 5, 2025Updated last year
- A Framework for Speech, Language, Audio, Music Processing with Large Language Model☆995Jan 15, 2026Updated last month
- This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"☆125Apr 8, 2022Updated 3 years ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- ☆324Jun 14, 2024Updated last year
- This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…☆440Aug 12, 2025Updated 6 months ago
- ☆60Oct 22, 2025Updated 4 months ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆87Dec 20, 2024Updated last year
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆59Feb 12, 2025Updated last year
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- A PyTorch implementation of End-to-End Neural Diarization☆109Jun 19, 2023Updated 2 years ago
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆22Jul 25, 2024Updated last year
- A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization☆2,811Dec 8, 2025Updated 2 months ago
- A neural speech codec based on discrete WavLM representations☆24Aug 28, 2024Updated last year
- Discriminative Training of VBx Diarization☆27Sep 23, 2024Updated last year
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- [EMNLP 2024] ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers☆125Mar 20, 2025Updated 11 months ago
- ☆85Jan 28, 2026Updated last month