zszyellow / WER-in-python
This program calculates the word error rate of hypothesis in ASR and print the aligned result.
☆154Updated 5 years ago
Alternatives and similar repositories for WER-in-python:
Users that are interested in WER-in-python are comparing it to the libraries listed below
- Yet another speech toolkit based on Kaldi and PyTorch☆173Updated 4 years ago
- experiments with RETURNN☆156Updated last month
- PyTorch implementation of LF-MMI for End-to-end ASR☆219Updated 4 years ago
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆126Updated 5 years ago
- A pure python module for reading and writing kaldi ark files☆252Updated last year
- ASR with PyTorch☆140Updated 5 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 2 years ago
- ☆273Updated 4 years ago
- Python module for evaluating ASR hypotheses (e.g. word error rate, word recognition rate).☆274Updated last year
- A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.☆200Updated 6 years ago
- Extract xvector and ivector under kaldi☆109Updated 6 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activities☆203Updated 3 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆140Updated last year
- MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks☆137Updated 3 years ago
- Moved to https://github.com/k2-fsa/icefall☆144Updated 2 years ago
- INTERSPEECH 2019 Tutorial Materials☆193Updated 3 years ago
- ☆68Updated 2 years ago
- A pytorch wrapper for LF-MMI training and parallel training in Kaldi☆73Updated 2 years ago
- Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data☆96Updated last year
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆309Updated 4 years ago
- Example code for a neural transducer model.☆61Updated last year
- Segment a given audio into utterances using a trained end-to-end ASR model.☆72Updated 4 years ago
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 5 years ago
- A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition☆232Updated 4 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆45Updated 4 years ago
- SEAME corpus two develop set☆38Updated 5 years ago
- Adapting your own Language Model for Kaldi☆64Updated 6 years ago
- End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.☆122Updated 4 years ago
- Code for prefix beam search tutorial by @labodk☆185Updated 4 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆327Updated 9 months ago