sholokhovalexey / online-speaker-clusteringLinks
☆17Updated 2 years ago
Alternatives and similar repositories for online-speaker-clustering
Users that are interested in online-speaker-clustering are comparing it to the libraries listed below
Sorting:
- ☆19Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Updated 2 years ago
- ☆14Updated 3 years ago
- acnn for text-independent speaker recognition☆10Updated 3 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆23Updated 5 months ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Updated 6 months ago
- ☆12Updated 8 months ago
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆22Updated 10 months ago
- [SLT'24] Mamba-based Decoder-Only Approach for Speech Recognition☆14Updated 8 months ago
- Balanced Error Rate for Speaker Diarization☆32Updated 2 years ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆28Updated last year
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆23Updated 8 months ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆17Updated 10 months ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated last year
- ☆19Updated 10 months ago
- ☆18Updated 3 months ago
- CDER (Conversational Diarization Error Rate) Scoring Tool☆21Updated 2 years ago
- ☆18Updated 3 years ago
- ☆15Updated last year
- Whisper Speech Quality Assessment (WhiSQA)☆12Updated 8 months ago
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 2 years ago
- ☆17Updated last year
- This repository presents an evaluation framework for speech-to-speech (S2S) models, following the methodology described in the EmphAsses …☆22Updated last year
- ☆10Updated last year
- ☆30Updated 2 years ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆12Updated 4 months ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆17Updated last week
- Official repository for Mamba-based Segmentation Model for Speaker Diarization☆37Updated 2 months ago
- Baseline kaldi script for UA-SPEECH corpus☆30Updated 9 months ago
- Official implementation of the INTERSPEECH 2024 paper: Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detect…☆42Updated 8 months ago