josepatino / pyBKLinks

Speaker diarization python system based on binary key speaker modelling

☆60

Alternatives and similar repositories for pyBK

Users that are interested in pyBK are comparing it to the libraries listed below

Sorting:

Jamiroquai88 / VBDiarization
Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data
☆95Updated 2 years ago
yinruiqing / change_detection
Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networks
☆65Updated 5 years ago
philipperemy / speaker-change-detection
Paper: https://arxiv.org/abs/1702.02285
☆64Updated 6 years ago
FlorianKrey / DNC
Discriminative Neural Clustering for Speaker Diarisation
☆78Updated 3 years ago
sciforce / phones-las
Articulatory features estimation using Listen Attend and Spell architecture.
☆32Updated 5 years ago
vishalshar / SpeakerDiarization_RNN_CNN_LSTM
Speaker Diarization is the problem of separating speakers in an audio. There could be any number of speakers and final result should stat…
☆64Updated 4 years ago
srvk / lm_build
Adapting your own Language Model for Kaldi
☆63Updated 6 years ago
DonkeyShot21 / uis-rnn-sml
A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)
☆62Updated 5 years ago
pyannote / pyannote-metrics
A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems
☆219Updated 5 months ago
py-lidbox / lidbox
End-to-end spoken language identification out of the box.
☆48Updated 4 years ago
faroit / CountNet
Deep Neural Network for Speaker Count Estimation
☆153Updated 4 years ago
funcwj / ge2e-speaker-verification
Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"
☆103Updated 6 years ago
swshon / voxceleb-ivector
Voxceleb1 i-vector based speaker recognition system
☆43Updated 7 years ago
xuchenglin28 / speaker_extraction
target speaker extraction and verification for multi-talker speech
☆180Updated 4 years ago
idiap / acoustic-simulator
Implementation of audio degradation processes
☆103Updated 9 years ago
Appen / UHV-OTS-Speech
A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.
☆101Updated 2 years ago
pyannote / DEPRECATED-pyannote-audio-hub
[deprecated] Pretrained models for pyannote-audio 1.x
☆71Updated 3 years ago
pyannote / pyannote-database
Reproducible experimental protocols for multimedia (audio, video, text) database
☆106Updated 5 months ago
staplesinLA / denoising_DIHARD18
☆60Updated 4 years ago
JRMeyer / multi-task-kaldi
An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…
☆54Updated 5 years ago
swshon / dialectID_e2e
End to End Dialect Identification using Convolutional Neural Network
☆52Updated 5 years ago
wq2012 / SimpleDER
A lightweight library to compute Diarization Error Rate (DER).
☆60Updated last year
alumae / online_speaker_change_detector
Online streaming speaker change detection model in Pytorch
☆41Updated 2 years ago
jefflai108 / pytorch-kaldi-neural-speaker-embeddings
A light weight neural speaker embeddings extraction based on Kaldi and PyTorch.
☆136Updated 5 years ago
wangyu09 / exkaldi-rt
An online speech recognition extension toolkit of Kaldi
☆56Updated 4 years ago
tbornt / phoneme_ctc
Bidirectional dynamic RNN + CTC for phoneme recognition
☆46Updated 5 years ago
jcsilva / multilingual-g2p
Multilingual Grapheme to Phoneme
☆50Updated 9 years ago
ynop / audiomate
Python library for handling audio datasets.
☆137Updated 2 years ago
mravanelli / pySpeechRev
This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…
☆95Updated 5 years ago
bootphon / shennong
A Python toolbox for speech features extraction
☆163Updated 2 years ago