sholokhovalexey / online-speaker-clusteringLinks
☆17Updated 2 years ago
Alternatives and similar repositories for online-speaker-clustering
Users that are interested in online-speaker-clustering are comparing it to the libraries listed below
Sorting:
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Updated 3 years ago
- ☆14Updated 3 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆24Updated 10 months ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆28Updated last year
- ☆18Updated last year
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆12Updated 11 months ago
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆23Updated last year
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Updated last year
- ☆32Updated 2 years ago
- ☆14Updated last year
- Official implementation of the APSIPA 2022 paper: Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Updated 3 years ago
- A simple command line tool to calculate WER for ASR.☆14Updated last year
- CDER (Conversational Diarization Error Rate) Scoring Tool☆22Updated 3 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated 2 years ago
- ☆28Updated 2 weeks ago
- ☆17Updated 2 years ago
- acnn for text-independent speaker recognition☆10Updated 3 years ago
- Objective metrics used in several text-to-speech (TTS) papers.☆52Updated 6 months ago
- ☆29Updated last year
- Clustering-based methods for overlapping diarization☆82Updated last year
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Updated 4 years ago
- ☆16Updated 2 years ago
- Discriminative Training of VBx Diarization☆26Updated last year
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆10Updated 7 months ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆26Updated 3 years ago
- Baseline kaldi script for UA-SPEECH corpus☆32Updated last year
- The project is associated with the recently-launched INTERSPEECH 2025 Workshop on Multilingual Conversational Speech Language Model (MLC-…☆48Updated 7 months ago
- Whisper Speech Quality Assessment (WhiSQA)☆16Updated 2 months ago
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆11Updated 9 months ago
- Goodness of Pronunciation algorithm using PyKaldi☆18Updated 3 years ago