cadia-lvl / kaldi-speaker-diarizationLinks
This repository creates speaker diarization recipes to be used within the egs folder of kaldi.
☆17Updated last year
Alternatives and similar repositories for kaldi-speaker-diarization
Users that are interested in kaldi-speaker-diarization are comparing it to the libraries listed below
Sorting:
- Online streaming speaker change detection model in Pytorch☆42Updated 2 years ago
- Python package for combining diarization system outputs.☆88Updated last year
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆23Updated 6 months ago
- Clustering-based methods for overlapping diarization☆81Updated last year
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆68Updated 3 years ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆48Updated 3 years ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆74Updated 2 months ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆23Updated 9 months ago
- Balanced Error Rate for Speaker Diarization☆32Updated 2 years ago
- ☆18Updated 3 years ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆36Updated last year
- ☆62Updated last year
- An online speech recognition extension toolkit of Kaldi☆56Updated 4 years ago
- ☆54Updated last year
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆56Updated 6 months ago
- Pronunciation-assisted Subword Modeling☆31Updated 6 years ago
- ☆27Updated 4 years ago
- Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi☆12Updated 2 years ago
- PyTorch implementation of RPNSD☆60Updated last year
- A list of papers for child ASR☆46Updated 10 months ago
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆41Updated 2 years ago
- CHIME-7/8 diarization champion system: neural speaker diarization using memory-aware multi-speaker embedding with sequence-to-sequence ar…☆78Updated 2 months ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆67Updated 3 months ago
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆45Updated 3 years ago
- The VoxTube dataset official repository☆70Updated last year
- Discriminative Training of VBx Diarization☆26Updated 11 months ago
- MeetEval - A meeting transcription evaluation toolkit☆108Updated last month
- Keyword spotting and forced alignment in any language☆63Updated last week
- Reproducible experimental protocols for multimedia (audio, video, text) database☆107Updated 6 months ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆29Updated last year