apple / ml-stuttering-events-datasetLinks

☆108

Alternatives and similar repositories for ml-stuttering-events-dataset

Users that are interested in ml-stuttering-events-dataset are comparing it to the libraries listed below

Sorting:

iiscleap / NISP-Dataset
☆30Updated 3 years ago
michen00 / unified_multilingual_dataset_of_emotional_human_utterances
A unified dataset of multilingual emotional human utterances
☆28Updated 3 years ago
luferrer / ConfidenceIntervals
Confidence interval computation for evaluation in machine learning using the bootstrapping approach
☆91Updated last year
hechmik / voxceleb_enrichment_age_gender
Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021
☆70Updated 3 years ago
IDRnD / VoxTube
The VoxTube dataset official repository
☆71Updated last year
RF5 / simple-speaker-embedding
A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.
☆90Updated 8 months ago
Lhx94As / Awesome-Spoken-Language-Identification
An awesome spoken LID repository. (Working in progress
☆108Updated last year
talhanai / speech-nlp-datasets
Contains links to publicly available datasets for modeling health outcomes using speech and language.
☆126Updated last year
felixbur / nkululeko
Machine learning speaker characteristics
☆41Updated this week
ankitapasad / layerwise-analysis
Layer-wise analysis of self-supervised pre-trained speech representations
☆120Updated last year
Xflick / EEND_PyTorch
A PyTorch implementation of End-to-End Neural Diarization
☆109Updated 2 years ago
shangeth / SpeakerProfiling
Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf
☆67Updated 4 years ago
guanlongzhao / fac-via-ppg
Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)
☆148Updated 2 years ago
BUTSpeechFIT / AMI-diarization-setup
☆54Updated 2 years ago
xinjli / transphone
phoneme tokenizer and grapheme-to-phoneme model for 8k languages
☆172Updated 2 years ago
joonson / voxconverse
Spot the conversation: speaker diarisation in the wild
☆156Updated 3 years ago
vocaliodmiku / wav2vec2mdd
End-to-End Mispronunciation Detection via wav2vec2.0
☆49Updated 4 years ago
BUTSpeechFIT / VBx
Variational Bayes HMM over x-vectors diarization
☆278Updated last year
dobby-seo / Wav2Keyword
Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.
☆109Updated 2 years ago
Voice-Privacy-Challenge / Voice-Privacy-Challenge-2024
Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software
☆61Updated 10 months ago
vocaliodmiku / wav2vec2mdd-Text
☆19Updated 3 years ago
nikvaessen / w2v2-speaker
Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053
☆145Updated 3 years ago
khanld / ASR-Wav2vec-Finetune
Finetune Wa2vec 2.0 For Speech Recognition
☆142Updated 10 months ago
drfeinberg / PraatScripts
These are praat scripts I use in my research, implemented in parselmouth for python for use in binder
☆133Updated 4 years ago
zhenghuatan / rVAD
Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …
☆137Updated last year
BenoitWang / Speech_Emotion_Diarization
☆69Updated last year
YuanGongND / vocalsound
Dataset and baseline code for the VocalSound dataset (ICASSP2022).
☆156Updated 3 years ago
lingjzhu / charsiu
Charsiu: A neural phonetic aligner.
☆323Updated 3 years ago
CorentinJ / librispeech-alignments
Word alignments generated by the Montreal Forced Aligner for the Librispeech dataset
☆173Updated 6 years ago
google-research-datasets / cvss
CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus
☆220Updated 3 years ago