Example implementation of Monotonic Chunkwise Attention.
☆53Feb 23, 2018Updated 8 years ago
Alternatives and similar repositories for mocha
Users that are interested in mocha are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch Implementation of "Monotonic Chunkwise Attention" (ICLR 2018)☆81Apr 2, 2018Updated 8 years ago
- streaming attention networks for end-to-end automatic speech recognition☆56May 6, 2020Updated 6 years ago
- experiments with RETURNN☆162May 8, 2026Updated 2 weeks ago
- Code for "Online and Linear Time Attention by Enforcing Monotonic Alignments"☆94May 2, 2018Updated 8 years ago
- ☆76Mar 18, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆277Jan 15, 2021Updated 5 years ago
- CAT is more than a CRF-based ASR toolkit: it provides a complete workflow for data-efficient end-to-end ASR, supporting CTC, CTC-CRF, RNN…☆369Feb 5, 2026Updated 3 months ago
- CUDA-Warp RNN-Transducer☆216Feb 22, 2023Updated 3 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆221Jan 14, 2021Updated 5 years ago
- A curated list of papers dedicated to edit-distance as objective function☆53Aug 22, 2020Updated 5 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- A fast parallel implementation of RNN Transducer.☆314Jun 7, 2023Updated 2 years ago
- A torch implementation of a recursion which turns out to be useful for RNN-T.☆149Aug 25, 2023Updated 2 years ago
- WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)☆20Feb 20, 2019Updated 7 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ESPnet-TTS Audio Sample HP☆21Oct 25, 2019Updated 6 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Code for end-to-end ASR with neural networks, build with TensorFlow☆110Jan 24, 2019Updated 7 years ago
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆39Jun 9, 2020Updated 5 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.☆74Oct 9, 2020Updated 5 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- An implementation of rnn transducer for sequence labeling problem☆22Feb 24, 2018Updated 8 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 8 years ago
- Some notes on Kaldi☆31Feb 20, 2015Updated 11 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Repo for the FB AI Speech team.☆25Aug 24, 2021Updated 4 years ago
- Automatically exported from code.google.com/p/transducersaurus☆11Apr 1, 2015Updated 11 years ago
- Tensorflow Implementation of WaveGlow☆37May 4, 2020Updated 6 years ago
- Custom decoders for Kaldi☆80Jun 10, 2019Updated 6 years ago
- Tensor2tensor experiment with SpecAugment☆46May 13, 2019Updated 7 years ago
- INTERSPEECH 2019 Tutorial Materials☆194Mar 30, 2021Updated 5 years ago
- WaveRNN-based waveform generator & demo of TensorFlow CuDNN-GRU usage.☆24Aug 19, 2018Updated 7 years ago
- ☆45Oct 24, 2020Updated 5 years ago
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆220Jun 22, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Text-to-Speech tutorial at SLTU 2016☆35May 10, 2016Updated 10 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Feb 13, 2020Updated 6 years ago
- Implementation of the Optimal Completion Distillation for Sequence Labeling☆17Jul 25, 2024Updated last year
- An implementation of a HMM Ngram language model.☆11Mar 12, 2015Updated 11 years ago
- NMT based punctuation prediction system using lexical and acoustic features .☆14Mar 30, 2020Updated 6 years ago
- a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi☆354Dec 25, 2020Updated 5 years ago
- Chinese Text Normalization and Dataset☆91May 14, 2022Updated 4 years ago