cadia-lvl / kaldi-speaker-diarization
This repository creates speaker diarization recipes to be used within the egs folder of kaldi.
☆16Updated 7 months ago
Alternatives and similar repositories for kaldi-speaker-diarization:
Users that are interested in kaldi-speaker-diarization are comparing it to the libraries listed below
- Online streaming speaker change detection model in Pytorch☆38Updated last year
- Python package for combining diarization system outputs.☆87Updated last year
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆76Updated 10 months ago
- A simple package for Guided source separation (GSS)☆118Updated 10 months ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆51Updated last month
- ☆54Updated last year
- Clustering-based methods for overlapping diarization☆80Updated last year
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆68Updated 7 months ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆21Updated last month
- End-to-End Mispronunciation Detection via wav2vec2.0☆43Updated 3 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆66Updated 3 years ago
- ☆26Updated last month
- Keyword spotting and forced alignment in any language☆54Updated 9 months ago
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆126Updated 3 weeks ago
- A list of papers for child ASR☆38Updated 5 months ago
- ☆43Updated last year
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆19Updated 4 months ago
- ☆35Updated 3 weeks ago
- ☆53Updated this week
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆63Updated last year
- ☆46Updated 4 years ago
- Discriminative Condition-Aware PLDA☆43Updated 8 months ago
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆40Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆48Updated 9 months ago
- Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way☆40Updated last year
- ☆61Updated last year
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆82Updated last year
- Script to perform statistical significance test between ASR hypotheses.☆21Updated 7 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆49Updated 2 years ago