Picovoice / speaker-diarization-benchmark
Speaker diarization benchmark framework
☆10Updated 9 months ago
Related projects: ⓘ
- ☆27Updated 5 months ago
- ☆48Updated 11 months ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 3 years ago
- ☆41Updated 7 months ago
- Speaker change detection using SincNet and an LSTM/Transformer☆39Updated 2 months ago
- ☆11Updated 2 years ago
- Implementation of CTC alignment-based single step non-autoregressive transformer☆11Updated last year
- Filtering and Noise Adding Tool☆29Updated 2 years ago
- steps to perform text-based speaker diarization with kaldi toolkit☆11Updated 5 years ago
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆12Updated 2 weeks ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Updated 2 years ago
- A list of papers for child ASR☆24Updated 5 months ago
- ☆30Updated 7 months ago
- Clustering-based methods for overlapping diarization☆68Updated 8 months ago
- ☆12Updated 2 months ago
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆13Updated 3 months ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆79Updated 5 months ago
- End-to-end diarization loss☆19Updated 3 years ago
- Discriminative Training of VBx Diarization☆17Updated 7 months ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆19Updated this week
- Text frontend for ESPnet tts recipes☆32Updated 3 years ago
- ☆41Updated 10 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆64Updated 11 months ago
- ☆22Updated 2 years ago
- Error correction back-end for speaker diarization☆9Updated 11 months ago
- Official implementation of "PhonMatchNet: Phoneme-Guided Zero-Shot Keyword Spotting for User-Defined Keywords" (INTERSPEECH 2023)☆35Updated 3 months ago
- Online streaming speaker change detection model in Pytorch☆34Updated last year
- This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…☆23Updated 3 weeks ago
- A simple command line tool to calculate WER for ASR.☆13Updated last year