sholokhovalexey / online-speaker-clusteringLinks
☆17Updated 2 years ago
Alternatives and similar repositories for online-speaker-clustering
Users that are interested in online-speaker-clustering are comparing it to the libraries listed below
Sorting:
- ☆19Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Updated 2 years ago
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 2 years ago
- ☆14Updated 3 years ago
- A simple command line tool to calculate WER for ASR.☆14Updated 10 months ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆29Updated last year
- ☆19Updated 11 months ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆17Updated 10 months ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆23Updated 6 months ago
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Updated 6 months ago
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆22Updated 11 months ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated 2 years ago
- CDER (Conversational Diarization Error Rate) Scoring Tool☆21Updated 2 years ago
- Baseline kaldi script for UA-SPEECH corpus☆31Updated 10 months ago
- ☆30Updated 2 years ago
- Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…☆24Updated 5 months ago
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆23Updated 9 months ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆10Updated 3 months ago
- acnn for text-independent speaker recognition☆10Updated 3 years ago
- Whisper Speech Quality Assessment (WhiSQA)☆15Updated 9 months ago
- Discriminative Training of VBx Diarization☆26Updated 11 months ago
- ☆13Updated 9 months ago
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆41Updated 2 years ago
- Balanced Error Rate for Speaker Diarization☆32Updated 2 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆33Updated 4 years ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆40Updated last year
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Updated 2 years ago
- ☆24Updated last year
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆67Updated 3 months ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆54Updated 2 years ago