craffel/mocha

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/craffel/mocha)

craffel / mocha

Example implementation of Monotonic Chunkwise Attention.

☆54

Alternatives and similar repositories for mocha

Users that are interested in mocha are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

j-min / MoChA-pytorch
View on GitHub
PyTorch Implementation of "Monotonic Chunkwise Attention" (ICLR 2018)
☆81Apr 2, 2018Updated 8 years ago
HaoranMiao / streaming-attention
View on GitHub
streaming attention networks for end-to-end automatic speech recognition
☆56May 6, 2020Updated 6 years ago
rwth-i6 / returnn-experiments
View on GitHub
experiments with RETURNN
☆162Jun 18, 2026Updated last month
craffel / mad
View on GitHub
Code for "Online and Linear Time Attention by Enforcing Monotonic Alignments"
☆94May 2, 2018Updated 8 years ago
athena-team / athena-decoder
View on GitHub
☆76Mar 18, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
cywang97 / StreamingTransformer
View on GitHub
☆277Jan 15, 2021Updated 5 years ago
thu-spmi / CAT
View on GitHub
CAT is more than a CRF-based ASR toolkit: it provides a complete workflow for data-efficient end-to-end ASR, supporting CTC, CTC-CRF, RNN…
☆368Feb 5, 2026Updated 5 months ago
1ytic / warp-rnnt
View on GitHub
CUDA-Warp RNN-Transducer
☆216Feb 22, 2023Updated 3 years ago
YiwenShaoStephen / pychain
View on GitHub
PyTorch implementation of LF-MMI for End-to-end ASR
☆221Jan 14, 2021Updated 5 years ago
1ytic / edit-distance-papers
View on GitHub
A curated list of papers dedicated to edit-distance as objective function
☆53Aug 22, 2020Updated 5 years ago
hirofumi0810 / neural_sp
View on GitHub
End-to-end ASR/LM implementation with PyTorch
☆594Aug 30, 2021Updated 4 years ago
HawkAaron / warp-transducer
View on GitHub
A fast parallel implementation of RNN Transducer.
☆314Jun 7, 2023Updated 3 years ago
k2-fsa / fast_rnnt
View on GitHub
A torch implementation of a recursion which turns out to be useful for RNN-T.
☆149Aug 25, 2023Updated 2 years ago
nttcslab-sp / torchain
View on GitHub
WIP: pytorch FFI wrapper for Kaldi chain loss (a.k.a. Lattice Free MMI)
☆20Feb 20, 2019Updated 7 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
espnet / icassp2020-tts
View on GitHub
ESPnet-TTS Audio Sample HP
☆21Oct 25, 2019Updated 6 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
vrenkens / nabu
View on GitHub
Code for end-to-end ASR with neural networks, build with TensorFlow
☆110Jan 24, 2019Updated 7 years ago
MarkWuNLP / SemanticMask
View on GitHub
The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"
☆39Jun 9, 2020Updated 6 years ago
cornerfarmer / ctc_segmentation
View on GitHub
Segment a given audio into utterances using a trained end-to-end ASR model.
☆75Oct 9, 2020Updated 5 years ago
r9y9 / icassp2020-espnet-tts-merlin-baseline
View on GitHub
ICASSP 2020 ESPnet-TTS: Merlin baseline system
☆37Oct 28, 2019Updated 6 years ago
sequence-labeling / rnn-transducer
View on GitHub
An implementation of rnn transducer for sequence labeling problem
☆22Feb 24, 2018Updated 8 years ago
oxinabox / Kaldi-Notes
View on GitHub
Some notes on Kaldi
☆32Feb 20, 2015Updated 11 years ago
markusdr / transducersaurus
View on GitHub
Automatically exported from code.google.com/p/transducersaurus
☆11Apr 1, 2015Updated 11 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
weixsong / WaveGlow
View on GitHub
Tensorflow Implementation of WaveGlow
☆37May 4, 2020Updated 6 years ago
jpuigcerver / kaldi-decoders
View on GitHub
Custom decoders for Kaldi
☆81Jun 10, 2019Updated 7 years ago
Kyubyong / specAugment
View on GitHub
Tensor2tensor experiment with SpecAugment
☆46May 13, 2019Updated 7 years ago
espnet / interspeech2019-tutorial
View on GitHub
INTERSPEECH 2019 Tutorial Materials
☆194Mar 30, 2021Updated 5 years ago
austinmoehle / wavernn
View on GitHub
WaveRNN-based waveform generator & demo of TensorFlow CuDNN-GRU usage.
☆24Aug 19, 2018Updated 7 years ago
mjansche / tts-tutorial
View on GitHub
Text-to-Speech tutorial at SLTU 2016
☆35May 10, 2016Updated 10 years ago
athena-team / DiDiSpeech
View on GitHub
☆45Oct 24, 2020Updated 5 years ago
burchim / EfficientConformer
View on GitHub
[ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition
☆221Jun 22, 2023Updated 3 years ago
ShigekiKarita / espnet-semi-supervised
View on GitHub
ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…
☆38Feb 13, 2020Updated 6 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
facebookresearch / fbai-speech
View on GitHub
Repo for the FB AI Speech team.
☆27Aug 24, 2021Updated 4 years ago
SaeedNajafi / pytorch-ocd
View on GitHub
Implementation of the Optimal Completion Distillation for Sequence Labeling
☆17Jul 25, 2024Updated last year
placebokkk / pyfst
View on GitHub
A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)
☆17Apr 2, 2018Updated 8 years ago
lmc2179 / ngram-language-model
View on GitHub
An implementation of a HMM Ngram language model.
☆10Mar 12, 2015Updated 11 years ago
ondrejklejch / acoustic_punctuation
View on GitHub
NMT based punctuation prediction system using lexical and acoustic features .
☆14Mar 30, 2020Updated 6 years ago
tencent-ailab / pika
View on GitHub
a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi
☆354Dec 25, 2020Updated 5 years ago
thuhcsi / FlatTN
View on GitHub
Chinese Text Normalization and Dataset
☆91May 14, 2022Updated 4 years ago