felixbur / nkululekoView external linksLinks
Machine learning speaker characteristics
☆43Updated this week
Alternatives and similar repositories for nkululeko
Users that are interested in nkululeko are comparing it to the libraries listed below
Sorting:
- For further understanding the wide array of emotions embedded in human speech, we are introducing an emotional speech corpus. In contrast…☆11Oct 29, 2018Updated 7 years ago
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆26Oct 5, 2022Updated 3 years ago
- Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…☆26Mar 13, 2025Updated 11 months ago
- Official implementation of the paper "Automatic Severity Assessment of Dysarthric speech by using Self-supervised Model with Multi-task L…☆11Feb 14, 2024Updated 2 years ago
- ☆30Aug 9, 2022Updated 3 years ago
- ☆14Updated this week
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- Handling audio files in Python☆39Updated this week
- Pytorch implementation of Transformer-TTS for converting text into speech.☆19Jul 9, 2021Updated 4 years ago
- Contains code for Deep Self Supervised Heirarchical Clustering for Speaker Diarization☆17Dec 16, 2021Updated 4 years ago
- A Kaldi recipe for training automatic speech recognition systems on the Torgo corpus of dysarthric speech☆17Sep 22, 2023Updated 2 years ago
- A fundamental frequency estimation algorithm using features from the magnitude and phase spectrogram.☆24Mar 29, 2021Updated 4 years ago
- ☆20Apr 5, 2021Updated 4 years ago
- A collection of dataset consists of a total of 8 English speech datasets for SER☆30Jan 8, 2025Updated last year
- Layer-wise analysis of self-supervised pre-trained speech representations☆126Oct 18, 2024Updated last year
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆71Dec 18, 2021Updated 4 years ago
- Python toolkit for likelihood-ratio calibration of binary classifiers☆27Feb 21, 2023Updated 2 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Dec 16, 2022Updated 3 years ago
- In-the-wild deepfake detection dataset☆13Mar 5, 2025Updated 11 months ago
- Python package for openSMILE☆304Jan 26, 2026Updated 2 weeks ago
- [RAVDESS] Speech Emotion Recognition with Convolutional Attention based Bi-GRU. (Best test accuracy of 87%)☆33Sep 29, 2023Updated 2 years ago
- REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REP…☆33Feb 16, 2024Updated last year
- Add Arabic diacritics (tashkeel/harakat) using Rust/Python/C++/WASM and NLP models☆45Oct 4, 2025Updated 4 months ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.☆10Feb 29, 2024Updated last year
- VS Code Extension for Multipass☆10Sep 25, 2024Updated last year
- A fast python library for aligning similar audio snippets passed in as NumPy arrays☆48Oct 27, 2025Updated 3 months ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆96Apr 5, 2024Updated last year
- ☆13Feb 25, 2023Updated 2 years ago
- The electronic Holly Quran browser Elforkane☆11Nov 14, 2021Updated 4 years ago
- Frequency tracking in time-frequency representations☆13Jan 19, 2021Updated 5 years ago
- Pytorch implementation of Generalized End-to-End Loss for speaker verification☆88Apr 23, 2019Updated 6 years ago
- This repository contains the speaker labeled information of VoxCeleb2 and LRS3 audio-visual datasets. (AAAI 2025)☆12Sep 6, 2024Updated last year
- Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris S…☆14Feb 15, 2023Updated 2 years ago
- ☆10May 13, 2018Updated 7 years ago
- Tool for creating Kaldi nnet3 recipes using the International Phonetic Alphabet (IPA)☆10Jun 2, 2021Updated 4 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆46Dec 27, 2022Updated 3 years ago
- Open-source audio embedding models, submitted to the HEAR 2021 challenge☆11Updated this week
- label and annotate large number of speech data files☆12May 5, 2021Updated 4 years ago
- Time frequency ridge detection based on relevant ridge portions☆11Aug 17, 2023Updated 2 years ago