vshmyhlo / listen-attend-and-speell-pytorch
Implementation of Automatic Speech Recognition inspired by "Listen, Attend and Spell" paper in PyTorch
☆11Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for listen-attend-and-speell-pytorch
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 3 months ago
- Lattice combination algorithm to combine inaccurate transcripts with hypothesis lattices☆16Updated 8 months ago
- WarpRNNT loss ported in Numba CPU/CUDA for Pytorch☆16Updated 2 years ago
- A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.☆15Updated 4 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Updated 4 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…☆11Updated 4 years ago
- ☆32Updated 2 months ago
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆19Updated last year
- Python implementation of CTC beam search decoder + agnostic LM scorer☆19Updated 3 years ago
- Losses and decoders for end-to-end ASR and OCR☆33Updated 4 years ago
- ☆16Updated 2 years ago
- ☆9Updated 4 years ago
- SIGMORPHON 2020 Shared Task: Grapheme-to-Phoneme, Unsupervised Induction of Morphology, and Typologically Diverse Morphological Inflectio…☆35Updated 3 years ago
- This repository contains the Kaldi LF-MMI implementation of the paper "Bayesian Learning of LF-MMI Trained Time Delay Neural Networks for…☆9Updated 2 years ago
- Grapheme to phoneme model for PyTorch☆40Updated 2 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Updated 5 years ago
- ☆20Updated 6 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆14Updated 2 years ago
- End-to-end diarization loss☆22Updated 3 years ago
- Tensor2tensor experiment with SpecAugment☆47Updated 5 years ago
- Automatically exported from code.google.com/p/transducersaurus☆11Updated 9 years ago
- A handy dataset of noises for ASR☆19Updated 5 years ago
- ☆12Updated 3 years ago
- This repository contains data used in the NAACL 2021 Paper - Proteno: Text Normalization with Limited Data for Fast Deployment in Text to…☆42Updated 3 years ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- phone inventory library☆15Updated last year
- Recurrent Neural Aligner☆49Updated 4 years ago