cadia-lvl / kaldi-speaker-diarizationLinks
This repository creates speaker diarization recipes to be used within the egs folder of kaldi.
☆17Updated last year
Alternatives and similar repositories for kaldi-speaker-diarization
Users that are interested in kaldi-speaker-diarization are comparing it to the libraries listed below
Sorting:
- Python package for combining diarization system outputs.☆92Updated 2 years ago
- Online streaming speaker change detection model in Pytorch☆44Updated 2 years ago
- Clustering-based methods for overlapping diarization☆82Updated 2 years ago
- ☆27Updated 5 years ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆79Updated 7 months ago
- scripts to align a given wave to its transcription using trained models by Kaldi☆35Updated 6 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆24Updated 11 months ago
- Word Error Rate Estimation☆15Updated 5 years ago
- ☆66Updated last year
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆33Updated 3 weeks ago
- Keyword spotting and forced alignment in any language☆85Updated 5 months ago
- Simple Python package for fast DER computation☆35Updated 2 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆71Updated 4 years ago
- Pronunciation-assisted Subword Modeling☆31Updated 6 years ago
- ☆53Updated 2 years ago
- MeetEval - A meeting transcription evaluation toolkit☆139Updated this week
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆47Updated 4 years ago
- Properly handle position-dependent phones in a subword lexicon FST☆31Updated 5 years ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆83Updated 7 months ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆58Updated 11 months ago
- Application for viewing Rich Transcription Time Marked (RTTM) files in an interactive way☆48Updated 2 years ago
- Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi☆12Updated 3 years ago
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆23Updated last year
- A better, faster, stronger version of the unbounded interleaved-state recurrent neural network (UIS-RNN)☆62Updated 5 years ago
- The VoxTube dataset official repository☆71Updated last year
- End-to-end spoken language identification out of the box.☆48Updated 5 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Updated 3 weeks ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆93Updated 2 years ago
- ☆17Updated 6 years ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆50Updated 4 years ago