apple / ml-stuttering-events-datasetLinks
☆108Updated last year
Alternatives and similar repositories for ml-stuttering-events-dataset
Users that are interested in ml-stuttering-events-dataset are comparing it to the libraries listed below
Sorting:
- ☆30Updated 3 years ago
- A unified dataset of multilingual emotional human utterances☆28Updated 3 years ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆91Updated last year
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆70Updated 3 years ago
- The VoxTube dataset official repository☆71Updated last year
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆90Updated 8 months ago
- An awesome spoken LID repository. (Working in progress☆108Updated last year
- Contains links to publicly available datasets for modeling health outcomes using speech and language.☆126Updated last year
- Machine learning speaker characteristics☆41Updated this week
- Layer-wise analysis of self-supervised pre-trained speech representations☆120Updated last year
- A PyTorch implementation of End-to-End Neural Diarization☆109Updated 2 years ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆67Updated 4 years ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆148Updated 2 years ago
- ☆54Updated 2 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆172Updated 2 years ago
- Spot the conversation: speaker diarisation in the wild☆156Updated 3 years ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆49Updated 4 years ago
- Variational Bayes HMM over x-vectors diarization☆278Updated last year
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆109Updated 2 years ago
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆61Updated 10 months ago
- ☆19Updated 3 years ago
- Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053☆145Updated 3 years ago
- Finetune Wa2vec 2.0 For Speech Recognition☆142Updated 10 months ago
- These are praat scripts I use in my research, implemented in parselmouth for python for use in binder☆133Updated 4 years ago
- Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …☆137Updated last year
- ☆69Updated last year
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆156Updated 3 years ago
- Charsiu: A neural phonetic aligner.☆323Updated 3 years ago
- Word alignments generated by the Montreal Forced Aligner for the Librispeech dataset☆173Updated 6 years ago
- CVSS: A Massively Multilingual Speech-to-Speech Translation Corpus☆220Updated 3 years ago