AssemblyAI / kaldi-asr-tutorialLinks
Repo for hosting tutorial code associated with the Kaldi Speech Recognition for Beginners - A Simple Tutorial blog by AssemblyAI
☆13Updated 2 years ago
Alternatives and similar repositories for kaldi-asr-tutorial
Users that are interested in kaldi-asr-tutorial are comparing it to the libraries listed below
Sorting:
- Predicts the level of noise and reverberation on your audiofiles☆177Updated 7 months ago
- Toy example to illustrate how to use kaldi recipes.☆13Updated 4 years ago
- Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - P…☆213Updated 6 months ago
- Matlab and Python libraries for an unsupervised method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsupervised …☆138Updated 2 years ago
- speech recognition using Kaldi framework☆12Updated 6 years ago
- iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform☆271Updated 6 months ago
- Wav2Keyword is keyword spotting(KWS) based on Wav2Vec 2.0. This model shows state-of-the-art in Speech commands dataset V1 and V2.☆109Updated 3 years ago
- Python implementation of the SRMR toolbox☆126Updated last year
- Confidence interval computation for evaluation in machine learning using the bootstrapping approach☆95Updated last year
- ☆46Updated 2 years ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆151Updated 8 months ago
- Clustering-based methods for overlapping diarization☆82Updated 2 years ago
- Collection of pretrained models for the Montreal Forced Aligner☆185Updated 4 months ago
- Spot the conversation: speaker diarisation in the wild☆157Updated 3 years ago
- Introduction to Speech Processing☆113Updated 3 months ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆79Updated 7 months ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆184Updated last year
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆34Updated 3 weeks ago
- Variational Bayes HMM over x-vectors diarization☆283Updated 2 years ago
- Implementation of audio degradation processes☆105Updated 10 years ago
- A Python module for interacting with Praat TextGrid files. Also includes a class for reading HTK .mlf files into Praat☆297Updated 2 years ago
- An online speech recognition extension toolkit of Kaldi☆56Updated 4 years ago
- Weighted Cross-entropy for Low-Resource Languages in Multilingual Speech Recognition☆15Updated last year
- ☆53Updated 2 years ago
- Target Speaker Extraction Toolkit☆244Updated 4 months ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆46Updated 3 years ago
- Phoneme segmentation using pre-trained speech models☆55Updated 3 years ago
- ☆37Updated 2 months ago
- ☆91Updated 9 months ago
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆93Updated 5 years ago