felixbur / nkululekoLinks
Machine learning speaker characteristics
☆34Updated last week
Alternatives and similar repositories for nkululeko
Users that are interested in nkululeko are comparing it to the libraries listed below
Sorting:
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆83Updated last year
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- Clustering-based methods for overlapping diarization☆81Updated last year
- A list of papers for child ASR☆42Updated 7 months ago
- MSP-Podcast Challenge Baseline Code☆22Updated 11 months ago
- ☆56Updated last year
- ☆30Updated 6 months ago
- This is the implementation our Interspeech 2022 paper " Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conv…☆19Updated last year
- ☆43Updated 2 years ago
- NOTSOFAR-1 Challenge: Distant Diarization and ASR☆53Updated 3 months ago
- Speech Human Evaluation Estimation Toolkit (SHEET)☆82Updated this week
- Layer-wise analysis of self-supervised pre-trained speech representations☆103Updated 7 months ago
- SA-toolkit: Speaker speech anonymization toolkit in python☆23Updated 2 months ago
- Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…☆23Updated 2 months ago
- VoicePAT is a modular and efficient toolkit for voice privacy research, with main focus on speaker anonymization.☆50Updated last year
- A library built for easier audio self-supervised training, downstream tasks evaluation☆118Updated 9 months ago
- Qualtric or Qualtreat? Generate Qualtrics listening tests for Text-To-Speech evaluations.☆35Updated 11 months ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- ☆34Updated last year
- TS-SEP: Joint Diarization and Separation Conditioned on Estimated Speaker Embeddings☆33Updated 8 months ago
- This repository contains official pytorch implementation and pre-trained models for the MR-RawNet.☆16Updated 11 months ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 4 years ago
- S3PRL-VC: A Voice Conversion Toolkit based on S3PRL☆100Updated 11 months ago
- Official implementation of DNSMOS Pro (accepted at INTERSPEECH 2024).☆40Updated 2 months ago
- A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech☆17Updated last year
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆52Updated 4 months ago
- An automatic prosodic boundary annotation tool for Text-to-Speech Synthesis (TTS).☆48Updated 11 months ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago
- A repo containing download guidance and corresponding scripts of the VoxBlink dataset.☆26Updated last year
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Updated last year