Example implementation of Monotonic Chunkwise Attention.
☆53Feb 23, 2018Updated 8 years ago
Alternatives and similar repositories for mocha
Users that are interested in mocha are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch Implementation of "Monotonic Chunkwise Attention" (ICLR 2018)☆81Apr 2, 2018Updated 7 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55May 6, 2020Updated 5 years ago
- experiments with RETURNN☆161Feb 7, 2026Updated last month
- Code for "Online and Linear Time Attention by Enforcing Monotonic Alignments"☆94May 2, 2018Updated 7 years ago
- ☆76Mar 18, 2022Updated 4 years ago
- ☆277Jan 15, 2021Updated 5 years ago
- A CRF-based ASR Toolkit☆366Feb 5, 2026Updated last month
- CUDA-Warp RNN-Transducer☆216Feb 22, 2023Updated 3 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆221Jan 14, 2021Updated 5 years ago
- A curated list of papers dedicated to edit-distance as objective function☆53Aug 22, 2020Updated 5 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- A fast parallel implementation of RNN Transducer.☆314Jun 7, 2023Updated 2 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆149Aug 25, 2023Updated 2 years ago
- WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)☆20Feb 20, 2019Updated 7 years ago
- Blitzing Fast CTC Beam Search Decoder☆186Oct 27, 2025Updated 4 months ago
- ESPnet-TTS Audio Sample HP☆21Oct 25, 2019Updated 6 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Code for end-to-end ASR with neural networks, build with TensorFlow☆110Jan 24, 2019Updated 7 years ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆39Jun 9, 2020Updated 5 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆74Oct 9, 2020Updated 5 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- An implementation of rnn transducer for sequence labeling problem☆22Feb 24, 2018Updated 8 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 7 years ago
- Some notes on Kaldi☆31Feb 20, 2015Updated 11 years ago
- Repo for the FB AI Speech team.☆25Aug 24, 2021Updated 4 years ago
- Automatically exported from code.google.com/p/transducersaurus☆11Apr 1, 2015Updated 10 years ago
- Tensorflow Implementation of WaveGlow☆37May 4, 2020Updated 5 years ago
- Custom decoders for Kaldi☆80Jun 10, 2019Updated 6 years ago
- INTERSPEECH 2019 Tutorial Materials☆194Mar 30, 2021Updated 4 years ago
- Tensor2tensor experiment with SpecAugment☆46May 13, 2019Updated 6 years ago
- WaveRNN-based waveform generator & demo of TensorFlow CuDNN-GRU usage.☆24Aug 19, 2018Updated 7 years ago
- ☆45Oct 24, 2020Updated 5 years ago
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆219Jun 22, 2023Updated 2 years ago
- Text-to-Speech tutorial at SLTU 2016☆35May 10, 2016Updated 9 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Feb 13, 2020Updated 6 years ago
- Implementation of the Optimal Completion Distillation for Sequence Labeling☆17Jul 25, 2024Updated last year
- An implementation of a HMM Ngram language model.☆11Mar 12, 2015Updated 11 years ago
- NMT based punctuation prediction system using lexical and acoustic features .☆14Mar 30, 2020Updated 5 years ago
- a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi☆352Dec 25, 2020Updated 5 years ago