cadia-lvl / kaldi-speaker-diarization
This repository creates speaker diarization recipes to be used within the egs folder of kaldi.
☆16Updated 6 months ago
Alternatives and similar repositories for kaldi-speaker-diarization:
Users that are interested in kaldi-speaker-diarization are comparing it to the libraries listed below
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆13Updated 2 years ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆48Updated last week
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆16Updated 3 years ago
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆14Updated 8 months ago
- ☆59Updated last year
- Error correction back-end for speaker diarization☆15Updated last year
- Discriminative Training of VBx Diarization☆23Updated 4 months ago
- Python package for combining diarization system outputs.☆86Updated last year
- Online streaming speaker change detection model in Pytorch☆38Updated last year
- ☆16Updated 2 years ago
- Clustering-based methods for overlapping diarization☆75Updated last year
- scripts to align a given wave to its transcription using trained models by Kaldi☆32Updated 5 years ago
- Score calibration for speaker verification☆24Updated 5 years ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- CDER (Conversational Diarization Error Rate) Scoring Tool☆17Updated 2 years ago
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆24Updated 4 months ago
- ☆27Updated 4 years ago
- Speaker change detection using SincNet and an LSTM/Transformer☆46Updated 7 months ago
- Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi☆12Updated 2 years ago
- A list of papers for child ASR☆37Updated 4 months ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆49Updated 2 years ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- ☆32Updated 3 years ago
- ☆56Updated 9 months ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Updated last year
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆15Updated 4 months ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆75Updated 2 years ago
- Discriminative Condition-Aware PLDA☆43Updated 6 months ago
- ☆43Updated 2 years ago
- Balanced Error Rate for Speaker Diarization☆29Updated last year