pyannote / pyannote-metricsLinks

A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems

☆220

Alternatives and similar repositories for pyannote-metrics

Users that are interested in pyannote-metrics are comparing it to the libraries listed below

Sorting:

nryant / dscore
Diarization scoring tools.
☆255Updated 2 years ago
nttcslab-sp / kaldiio
A pure python module for reading and writing kaldi ark files
☆260Updated 5 months ago
Jamiroquai88 / VBDiarization
Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data
☆95Updated 2 years ago
hitachi-speech / EEND
End-to-End Neural Diarization
☆405Updated 3 years ago
BUTSpeechFIT / VBx
Variational Bayes HMM over x-vectors diarization
☆275Updated last year
YiwenShaoStephen / pychain
PyTorch implementation of LF-MMI for End-to-end ASR
☆220Updated 4 years ago
FlorianKrey / DNC
Discriminative Neural Clustering for Speaker Diarisation
☆79Updated 3 years ago
jzlianglu / pykaldi2
Yet another speech toolkit based on Kaldi and PyTorch
☆174Updated 5 years ago
joonson / voxconverse
Spot the conversation: speaker diarisation in the wild
☆143Updated 3 years ago
tango4j / Auto-Tuning-Spectral-Clustering
This repo is for the SPL paper "Auto-Tuning Spectral Clustering for Speaker Diarization Using Normalized Maximum Eigengap"
☆121Updated 3 years ago
wq2012 / SimpleDER
A lightweight library to compute Diarization Error Rate (DER).
☆60Updated last year
mycrazycracy / tf-kaldi-speaker
Neural speaker recognition/verification system based on Kaldi and Tensorflow
☆32Updated 5 years ago
google / speaker-id
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…
☆425Updated last week
bjfu-ai-institute / speaker-recognition-papers
Share some recent speaker recognition papers and their implementations.
☆90Updated 5 years ago
zhenghuatan / rVAD
Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …
☆136Updated last year
idiap / acoustic-simulator
Implementation of audio degradation processes
☆103Updated 9 years ago
KarelVesely84 / kaldi-io-for-python
Python functions for reading kaldi data formats. Useful for rapid prototyping with python.
☆377Updated 2 years ago
funcwj / setk
Tools for Speech Enhancement integrated with Kaldi
☆418Updated 2 years ago
manojpamk / pytorch_xvectors
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196
☆319Updated 4 years ago
jinserk / pytorch-asr
ASR with PyTorch
☆139Updated 6 years ago
DonkeyShot21 / uis-rnn-sml
A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)
☆62Updated 5 years ago
pyannote / pyannote-database
Reproducible experimental protocols for multimedia (audio, video, text) database
☆107Updated 6 months ago
pyannote / pyannote-core
Advanced data structures for handling temporal segments with attached labels.
☆114Updated 6 months ago
k2-fsa / snowfall
Moved to https://github.com/k2-fsa/icefall
☆146Updated 2 years ago
jefflai108 / pytorch-kaldi-neural-speaker-embeddings
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
☆136Updated 5 years ago
espnet / interspeech2019-tutorial
INTERSPEECH 2019 Tutorial Materials
☆195Updated 4 years ago
robmsmt / ASR-Audio-Data-Links
A list of publically available audio data that anyone can download for ASR or other speech activities
☆220Updated 4 years ago
RicherMans / GPV
Repository for our Interspeech2020 general-purpose voice activity detection (GPVAD) paper
☆142Updated 2 years ago
philipperemy / speaker-change-detection
Paper: https://arxiv.org/abs/1702.02285
☆64Updated 6 years ago
bootphon / shennong
A Python toolbox for speech features extraction
☆164Updated 2 years ago