felixbur / nkululeko
Machine learning speaker characteristics
☆33Updated 2 weeks ago
Alternatives and similar repositories for nkululeko:
Users that are interested in nkululeko are comparing it to the libraries listed below
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆77Updated 11 months ago
- Clustering-based methods for overlapping diarization☆77Updated last year
- A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech☆17Updated last year
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆67Updated 3 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- ☆43Updated 2 years ago
- ☆57Updated 10 months ago
- Python toolkit for speech processing☆68Updated last week
- ☆52Updated last year
- Official repository of NeXt-TDNN for speaker verification☆67Updated 5 months ago
- MSP-Podcast Challenge Baseline Code☆20Updated 9 months ago
- A list of papers for child ASR☆38Updated 5 months ago
- Phoneme segmentation using pre-trained speech models☆55Updated 2 years ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- ☆79Updated 6 months ago
- This repository contains official pytorch implementation and pre-trained models for the MR-RawNet.☆14Updated 9 months ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆81Updated last year
- SLT 2024 Challenge: Post-ASR-Speaker-Tagging☆14Updated 8 months ago
- This is the code and dataset repo for Interspeech 2024 paper "Target conversation extraction: Source separation using turn-taking dynamic…☆46Updated 5 months ago
- ☆61Updated last year
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆21Updated last year
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆49Updated 9 months ago
- A simple package for Guided source separation (GSS)☆117Updated 9 months ago
- ☆29Updated 3 months ago
- Multipurpose Multi Speaker Mixture Signal Generator☆44Updated last month
- multilingual speech aligner☆72Updated last year
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆25Updated 10 months ago
- Layer-wise analysis of self-supervised pre-trained speech representations☆101Updated 4 months ago
- ☆63Updated 6 months ago