apple / ml-stuttering-events-datasetLinks
☆107Updated last year
Alternatives and similar repositories for ml-stuttering-events-dataset
Users that are interested in ml-stuttering-events-dataset are comparing it to the libraries listed below
Sorting:
- ☆30Updated 3 years ago
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆89Updated last year
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆170Updated 2 years ago
- Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams (Interspeech'19)☆145Updated 2 years ago
- End-to-End Mispronunciation Detection via wav2vec2.0☆48Updated 3 years ago
- A PyTorch implementation of End-to-End Neural Diarization☆108Updated 2 years ago
- Machine learning speaker characteristics☆41Updated last week
- A speaker embedding network in Pytorch that is very quick to set up and use for whatever purposes.☆91Updated 6 months ago
- Predicts the level of noise and reverberation on your audiofiles☆164Updated 3 months ago
- An awesome spoken LID repository. (Working in progress☆106Updated last year
- Various speech datasets made available to the public☆131Updated 9 months ago
- Collection of pretrained models for the Montreal Forced Aligner☆167Updated last week
- Layer-wise analysis of self-supervised pre-trained speech representations☆117Updated 11 months ago
- Word alignments generated by the Montreal Forced Aligner for the Librispeech dataset☆170Updated 6 years ago
- Code and data repository for paper "VoxCeleb enrichment for Age and Gender recognition" submitted at ASRU 2021☆69Updated 3 years ago
- Baseline Recipe for VoicePrivacy Challenge 2024: anonymization systems and evaluation software☆59Updated 8 months ago
- Linguistic processing for Common Voice☆57Updated last year
- ☆67Updated 3 months ago
- A unified dataset of multilingual emotional human utterances☆28Updated 3 years ago
- ☆54Updated last year
- Charsiu: A neural phonetic aligner.☆315Updated 3 years ago
- These are praat scripts I use in my research, implemented in parselmouth for python for use in binder☆132Updated 3 years ago
- Research code for the paper "Fine-tuning wav2vec2 for speaker recognition" found at https://arxiv.org/abs/2109.15053☆145Updated 3 years ago
- Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …☆137Updated last year
- Variational Bayes HMM over x-vectors diarization☆275Updated last year
- ☆18Updated 3 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆76Updated 3 years ago
- Spot the conversation: speaker diarisation in the wild☆147Updated 3 years ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆108Updated 2 years ago
- INTERSPEECH 23 - Refunction Whisper to recognize new tasks with adapters!☆43Updated 2 years ago