cadia-lvl / kaldi-speaker-diarization
This repository creates speaker diarization recipes to be used within the egs folder of kaldi.
☆13Updated last month
Related projects: ⓘ
- Online streaming speaker change detection model in Pytorch☆34Updated last year
- Clustering-based methods for overlapping diarization☆68Updated 8 months ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Updated 2 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆19Updated this week
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆15Updated 3 years ago
- ☆48Updated 11 months ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- Python package for combining diarization system outputs.☆73Updated 11 months ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆63Updated 2 years ago
- MultiSV: scripts for data preparation☆24Updated 3 months ago
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆24Updated 2 months ago
- ☆27Updated 3 years ago
- Discriminative Training of VBx Diarization☆17Updated 7 months ago
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆12Updated 3 months ago
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)☆35Updated 3 months ago
- Discriminative Condition-Aware PLDA☆42Updated last month
- ☆31Updated 2 weeks ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆18Updated last year
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆38Updated 3 years ago
- Score calibration for speaker verification☆23Updated 4 years ago
- DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020☆22Updated 3 years ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆42Updated this week
- Filtering and Noise Adding Tool☆29Updated 2 years ago
- A set of audio augmentation techniques to perform noise insertion in datasets used for Automatic Speech Recognition.☆27Updated 2 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆24Updated last year
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆31Updated last year
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆60Updated 6 months ago
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Updated 4 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 3 years ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Updated last year