OSU-slatelab / LibriStutterLinks
A recipe for disfluency detection on the LibriStutter dataset using SpeechBrain
☆11Updated 4 years ago
Alternatives and similar repositories for LibriStutter
Users that are interested in LibriStutter are comparing it to the libraries listed below
Sorting:
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆11Updated last week
- Feature extractor for DL speech processing.☆66Updated 3 years ago
- A collection of utilities for handling IPA phones.☆26Updated 2 years ago
- asr2k☆52Updated last year
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆27Updated 10 months ago
- ☆12Updated 4 years ago
- ☆17Updated 2 years ago
- ☆40Updated 3 years ago
- Code repository for the paper "Improving End-to-End SLU performance with Prosodic Attention and Distillation" accepted at Interspeech 202…☆27Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆106Updated 2 years ago
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆18Updated 2 years ago
- Simple Python package for fast DER computation☆35Updated 2 years ago
- ☆56Updated 3 years ago
- Pypi installable TDNN and TDNN-F layers for PyTorch based acoustic model training☆41Updated 5 years ago
- Python package of MP-SENet from Explicit Estimation of Magnitude and Phase Spectra in Parallel for High-Quality Speech Enhancement.☆21Updated last year
- ☆37Updated 2 months ago
- Grapheme to phoneme model for PyTorch☆42Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Phoneme segmentation using pre-trained speech models☆55Updated 3 years ago
- Balanced Error Rate for Speaker Diarization☆33Updated 2 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆45Updated 3 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Updated 2 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆16Updated 2 years ago
- Clustering-based methods for overlapping diarization☆82Updated 2 years ago
- Forced Alignments for Common Voice☆32Updated 5 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆74Updated 5 years ago
- CTC Decoder implementation with python only. Also supports language model decoding using KenLM.☆37Updated last year
- phone inventory library☆17Updated 2 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone …☆43Updated 3 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆52Updated 3 years ago