HawkAaron/E2E-ASR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HawkAaron/E2E-ASR)

HawkAaron / E2E-ASR

PyTorch Implementations for End-to-End Automatic Speech Recognition

☆127

Alternatives and similar repositories for E2E-ASR

Users that are interested in E2E-ASR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HawkAaron / warp-transducer
View on GitHub
A fast parallel implementation of RNN Transducer.
☆314Jun 7, 2023Updated 3 years ago
HawkAaron / RNN-Transducer
View on GitHub
MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
☆140Jun 7, 2021Updated 5 years ago
awni / transducer
View on GitHub
A Fast Sequence Transducer Implementation with PyTorch Bindings
☆200Sep 20, 2022Updated 3 years ago
hirofumi0810 / neural_sp
View on GitHub
End-to-end ASR/LM implementation with PyTorch
☆594Aug 30, 2021Updated 4 years ago
ZhengkunTian / rnn-transducer
View on GitHub
A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition
☆239May 12, 2020Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
1ytic / warp-rnnt
View on GitHub
CUDA-Warp RNN-Transducer
☆216Feb 22, 2023Updated 3 years ago
awni / speech
View on GitHub
A PyTorch Implementation of End-to-End Models for Speech-to-Text
☆768Jul 6, 2023Updated 3 years ago
mdangschat / ctc-asr
View on GitHub
End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
☆123Apr 15, 2020Updated 6 years ago
gentaiscool / end2end-asr-pytorch
View on GitHub
End-to-End Automatic Speech Recognition on PyTorch
☆304Jun 2, 2022Updated 4 years ago
MarkWuNLP / SemanticMask
View on GitHub
The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"
☆39Jun 9, 2020Updated 6 years ago
shanguanma / Aligners
View on GitHub
HMM, CTC, RNN-Transducer, forward-backward algorithm
☆20Sep 5, 2023Updated 2 years ago
cywang97 / StreamingTransformer
View on GitHub
☆277Jan 15, 2021Updated 5 years ago
datemoon / tf-code-acoustics
View on GitHub
it's a train acoustics model code lib
☆27May 20, 2020Updated 6 years ago
DemisEom / RNNT-pytorch
View on GitHub
Implementaion RNN tranceducer
☆23Jun 25, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
wenet-e2e / speech-recognition-papers
View on GitHub
Towards hot directions in industrial end to end speech recognition
☆329Nov 30, 2021Updated 4 years ago
noahchalifour / rnnt-speech-recognition
View on GitHub
End-to-end speech recognition using RNN Transducers in Tensorflow 2.0
☆250Jul 15, 2025Updated last year
YiwenShaoStephen / pychain
View on GitHub
PyTorch implementation of LF-MMI for End-to-end ASR
☆221Jan 14, 2021Updated 5 years ago
kaituoxu / Speech-Transformer
View on GitHub
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
☆810Apr 6, 2023Updated 3 years ago
chenjiasheng / mwer
View on GitHub
mWER loss implementation in tensorflow
☆31Sep 7, 2020Updated 5 years ago
placebokkk / pyfst
View on GitHub
A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)
☆17Apr 2, 2018Updated 8 years ago
TeaPoly / CTC-OptimizedLoss
View on GitHub
Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.
☆59Sep 6, 2023Updated 2 years ago
Alexander-H-Liu / End-to-end-ASR-Pytorch
View on GitHub
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pyt…
☆1,210Dec 19, 2020Updated 5 years ago
zyascend / End-to-End-Speech-Recognition-Learning
View on GitHub
ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别
☆12Oct 25, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Diamondfan / CTC_pytorch
View on GitHub
CTC end -to-end ASR for timit and 863 corpus.
☆219Dec 20, 2019Updated 6 years ago
thu-spmi / CAT
View on GitHub
CAT is more than a CRF-based ASR toolkit: it provides a complete workflow for data-efficient end-to-end ASR, supporting CTC, CTC-CRF, RNN…
☆368Feb 5, 2026Updated 5 months ago
jinserk / pytorch-asr
View on GitHub
ASR with PyTorch
☆139Mar 10, 2019Updated 7 years ago
oshindow / Transformer-Transducer
View on GitHub
A pytorch_lightning reimplementation of the Transducer module from ESPnet.
☆78Mar 11, 2021Updated 5 years ago
athena-team / athena-decoder
View on GitHub
☆76Mar 18, 2022Updated 4 years ago
markusdr / transducersaurus
View on GitHub
Automatically exported from code.google.com/p/transducersaurus
☆11Apr 1, 2015Updated 11 years ago
HawkAaron / mxnet-transducer
View on GitHub
Fast parallel RNN-Transducer.
☆10Nov 1, 2019Updated 6 years ago
githubharald / CTCDecoder
View on GitHub
Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing…
☆837Jan 31, 2026Updated 5 months ago
k2-fsa / fast_rnnt
View on GitHub
A torch implementation of a recursion which turns out to be useful for RNN-T.
☆149Aug 25, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
zcaceres / spec_augment
View on GitHub
🔦 A Pytorch implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆501Jun 11, 2021Updated 5 years ago
theblackcat102 / edgedict
View on GitHub
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
☆292Aug 5, 2021Updated 4 years ago
jctian98 / e2e_lfmmi
View on GitHub
E2E system with LF-MMI; word N-gram for Mandarin
☆167Apr 29, 2022Updated 4 years ago
jzlianglu / pykaldi2
View on GitHub
Yet another speech toolkit based on Kaldi and PyTorch
☆173Jul 1, 2020Updated 6 years ago
rwth-i6 / returnn-experiments
View on GitHub
experiments with RETURNN
☆162Jun 18, 2026Updated last month
irebai / SpecAugment_KALDI
View on GitHub
A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition
☆15Sep 4, 2019Updated 6 years ago
mobvoi / lstm_ctc
View on GitHub
LSTM CTC End2End Speech Recognition.
☆38Apr 2, 2019Updated 7 years ago