ZhihaoDU / du2022sond
Speaker overlap-aware Neural Diarization
☆10Updated last year
Related projects ⓘ
Alternatives and complementary repositories for du2022sond
- ☆16Updated 2 years ago
- Discriminative Training of VBx Diarization☆18Updated last month
- ☆29Updated 2 years ago
- An official implementation of the ICASSP 2023 paper: SG-VAD: Stochastic Gates Based Speech Activity Detection☆23Updated 4 months ago
- Exploring Binary Classification Loss for Speaker Verification☆14Updated last year
- ☆32Updated 3 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆23Updated 7 months ago
- A pytorch implementation of the paper "ANSD-MA-MSE: Adaptive Neural Speaker Diarization Using Memory-Aware Multi-Speaker Embedding"☆45Updated 2 months ago
- Python package for combining diarization system outputs.☆75Updated last year
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆19Updated last year
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆15Updated 3 years ago
- This repository contains a set of codes to run (i.e., train, perform inference with, evaluate) a diarization method called EEND-vector-cl…☆70Updated 2 years ago
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Updated 4 years ago
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆14Updated 3 months ago
- MultiSV: scripts for data preparation☆25Updated last week
- ☆50Updated last year
- Discriminative Condition-Aware PLDA☆42Updated 3 months ago
- Development Toolkit for the VoxCeleb Speaker Recognition Challenge 2020☆42Updated 4 years ago
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆55Updated 4 years ago
- DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020☆22Updated 4 years ago
- Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch☆26Updated 3 months ago
- ☆49Updated 6 months ago
- ☆32Updated 2 years ago
- The code for the Interspeech paper "Speaker Embedding Extraction with Phonetic Information"☆43Updated 5 years ago
- Attention Backend for Aotumatic Speaker Verification with Multiple Enrollment Utterances☆48Updated 2 years ago
- ADAPTING SELF-SUPERVISED MODELS TO MULTI-TALKER SPEECH RECOGNITION USING SPEAKER EMBEDDINGS☆27Updated last year
- Score calibration for speaker verification☆24Updated 4 years ago
- End-to-end MOdeling of ASR (Automatic Speech Recognition)☆33Updated last year
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆14Updated 5 months ago