sarahjuan / iban
☆11Updated 9 years ago
Related projects: ⓘ
- A handy dataset of noises for ASR☆19Updated 5 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 7 months ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆12Updated 3 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated last month
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated last year
- ☆11Updated 2 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 3 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆14Updated 2 years ago
- ☆16Updated 3 years ago
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.☆16Updated 6 months ago
- ☆10Updated 11 months ago
- phone inventory library☆14Updated last year
- A library of speech gadgets.☆13Updated last year
- ☆22Updated 3 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆15Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆27Updated last year
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Updated 4 years ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic bias☆20Updated 4 years ago
- ☆9Updated 4 years ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Unsupervised speech activity detection system.☆11Updated 6 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆15Updated 3 years ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Updated 6 months ago
- Evaluation of STT models for german language☆15Updated 2 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆24Updated 3 years ago
- ☆17Updated last year
- ☆10Updated 2 years ago
- Perform the forced decoding with target transcription☆11Updated 6 years ago
- Code for the paper: How Much Context Does My Attention-Based ASR System Need?☆9Updated 2 months ago