OSU-slatelab / LibriStutter
A recipe for disfluency detection on the LibriStutter dataset using SpeechBrain
β9Updated 3 years ago
Related projects: β
- A collection of utilities for handling IPA phones.β22Updated 11 months ago
- π₯ π€ The largest clinical study in the world to collect voice data labeled with health information (N>6,000 participants, 48 utterancesβ¦β28Updated 3 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawlerβ24Updated 3 years ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202β¦β23Updated last year
- Code for the winning solution in the SE&R 2022 Challenge - SER track.β13Updated last year
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.β26Updated last month
- β16Updated 5 years ago
- Zero-Shot Foreign Accent Conversion without a Native Referenceβ27Updated 4 months ago
- asr2kβ48Updated 3 months ago
- Multilingual acoustic word embedding approaches applied and evaluated on GlobalPhone data.β10Updated 3 years ago
- Code for AccentDB.β20Updated 3 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.β13Updated 4 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.β13Updated last year
- β11Updated 2 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Spβ¦β12Updated last year
- Coqui Inference Engineβ38Updated 3 years ago
- β12Updated 3 years ago
- Self-Supervised Speech Pre-training and Representation Learning Toolkit.β8Updated 2 years ago
- Dataset Release for Intent Classification from Speechβ43Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated 7 months ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic biasβ20Updated 4 years ago
- This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfacβ¦β43Updated 2 months ago
- Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard feaβ¦β14Updated last year
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.β12Updated 3 years ago
- Implementation of the DIVA model of speech acquisition and production using PyTorchβ20Updated last year
- β42Updated 2 years ago
- Viterbi decoding in PyTorchβ23Updated 3 weeks ago
- Convert words to numbersβ20Updated 2 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IWβ¦β15Updated last year
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?β9Updated 2 months ago