OSU-slatelab / LibriStutter
A recipe for disfluency detection on the LibriStutter dataset using SpeechBrain
☆8Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for LibriStutter
- Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard fea…☆15Updated last year
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆16Updated 3 weeks ago
- BurrMill core☆21Updated 3 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆13Updated last month
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Updated 2 weeks ago
- ☆11Updated 3 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆13Updated last year
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆12Updated 3 years ago
- A collection of utilities for handling IPA phones.☆25Updated last year
- ☆12Updated 3 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Y…☆25Updated 5 years ago
- 🏥 🎤 The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterances…☆28Updated 3 years ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- Docker image and scripts for training finetuned or completely personal Kaldi speech models. Particularly for use with kaldi-active-gramma…☆20Updated 2 years ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.☆10Updated 4 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 3 months ago
- Implementation of the paper "BERTphone: Phonetically-aware Encoder Representations for Utterance-level Speaker and Language Recognition"☆17Updated 3 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆18Updated 8 months ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆13Updated 4 years ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic bias☆21Updated 4 years ago
- Train a fiwGAN or ciwGAN model using your own training data☆13Updated 2 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.☆15Updated 4 years ago
- Grapheme to phoneme model for PyTorch☆40Updated 2 years ago
- ABX and kaldi experiments on speech corpora made easy☆31Updated last month
- ConMamba for Automatic Speech Recognition☆44Updated 3 months ago
- ☆33Updated 3 years ago
- ☆24Updated 6 years ago
- Convert words to numbers☆20Updated 2 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 3 years ago
- ☆9Updated 4 years ago