Example implementation of Monotonic Chunkwise Attention.
☆53Feb 23, 2018Updated 8 years ago
Alternatives and similar repositories for mocha
Users that are interested in mocha are comparing it to the libraries listed below
Sorting:
- PyTorch Implementation of "Monotonic Chunkwise Attention" (ICLR 2018)☆81Apr 2, 2018Updated 7 years ago
- streaming attention networks for end-to-end automatic speech recognition☆55May 6, 2020Updated 5 years ago
- experiments with RETURNN☆161Feb 7, 2026Updated 3 weeks ago
- ☆276Jan 15, 2021Updated 5 years ago
- Code for "Online and Linear Time Attention by Enforcing Monotonic Alignments"☆94May 2, 2018Updated 7 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆220Jan 14, 2021Updated 5 years ago
- A curated list of papers dedicated to edit-distance as objective function☆53Aug 22, 2020Updated 5 years ago
- ☆76Mar 18, 2022Updated 3 years ago
- A CRF-based ASR Toolkit☆364Feb 5, 2026Updated 3 weeks ago
- CUDA-Warp RNN-Transducer☆216Feb 22, 2023Updated 3 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆150Aug 25, 2023Updated 2 years ago
- A fast parallel implementation of RNN Transducer.☆314Jun 7, 2023Updated 2 years ago
- Blitzing Fast CTC Beam Search Decoder☆186Oct 27, 2025Updated 4 months ago
- Some notes on Kaldi☆31Feb 20, 2015Updated 11 years ago
- WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)☆20Feb 20, 2019Updated 7 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 7 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Tensorflow Implementation of WaveGlow☆37May 4, 2020Updated 5 years ago
- ESPnet-TTS Audio Sample HP☆21Oct 25, 2019Updated 6 years ago
- Custom decoders for Kaldi☆80Jun 10, 2019Updated 6 years ago
- Code for end-to-end ASR with neural networks, build with TensorFlow☆110Jan 24, 2019Updated 7 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- Text-to-Speech tutorial at SLTU 2016☆35May 10, 2016Updated 9 years ago
- Chinese Text Normalization and Dataset☆91May 14, 2022Updated 3 years ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆39Jun 9, 2020Updated 5 years ago
- INTERSPEECH 2019 Tutorial Materials☆194Mar 30, 2021Updated 4 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Feb 13, 2020Updated 6 years ago
- Various algorithms for voice activity detection☆22Jan 31, 2017Updated 9 years ago
- WaveRNN-based waveform generator & demo of TensorFlow CuDNN-GRU usage.☆24Aug 19, 2018Updated 7 years ago
- Code:Completely Unsupervised Speech Recognition By A Generative Adversarial Network Harmonized With Iteratively Refined Hidden Markov Mod…☆25Dec 17, 2019Updated 6 years ago
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆219Jun 22, 2023Updated 2 years ago
- Repo for the FB AI Speech team.☆25Aug 24, 2021Updated 4 years ago
- Decoders from Kaldi using OpenFst☆34Jan 29, 2026Updated last month
- An implementation of rnn transducer for sequence labeling problem☆22Feb 24, 2018Updated 8 years ago
- ☆67Mar 25, 2022Updated 3 years ago
- c++ Kaldi IO lib (static and dynamic).☆25Nov 26, 2018Updated 7 years ago
- maracas is a library for corrupting audio files with additive and convolutive noise.☆72Aug 22, 2017Updated 8 years ago
- High-level API for tar-based dataset☆12Feb 3, 2024Updated 2 years ago