eastonYi/end-to-end_asr_pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/eastonYi/end-to-end_asr_pytorch)

eastonYi / end-to-end_asr_pytorch

Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch

☆23

Alternatives and similar repositories for end-to-end_asr_pytorch

Users that are interested in end-to-end_asr_pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zzpDapeng / Transformer-Transducer
View on GitHub
A streamable speech recognition model with transformer encoders and RNN-T loss
☆11Mar 1, 2021Updated 5 years ago
lightning830 / E2E-audio-speech-recognition
View on GitHub
Conformer encoder + Transformer decoder with Hybrid CTC/attention
☆12Nov 11, 2021Updated 4 years ago
ZhengkunTian / rnn-transducer
View on GitHub
A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition
☆239May 12, 2020Updated 6 years ago
tongjinle123 / speech-transformer-pytorch_lightning
View on GitHub
ASR project with pytorch-lightning
☆20Mar 21, 2025Updated last year
LeeYongHyeok / DCM_vgg_transformer
View on GitHub
Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…
☆14Jul 2, 2020Updated 6 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
igormq / ctcdecode-pytorch
View on GitHub
Python implementation of CTC beam search decoder + agnostic LM scorer
☆20Dec 16, 2020Updated 5 years ago
dalinvip / PyTorch_Chinese_word_segmentation
View on GitHub
Chinese word segmentation with the neural seq2seq model implement in pytorch
☆10Dec 13, 2017Updated 8 years ago
chenjiasheng / mwer
View on GitHub
mWER loss implementation in tensorflow
☆31Sep 7, 2020Updated 5 years ago
TeaPoly / warp-ctc-crf
View on GitHub
An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.
☆12Jul 5, 2021Updated 5 years ago
wenet-e2e / speech-recognition-papers
View on GitHub
Towards hot directions in industrial end to end speech recognition
☆329Nov 30, 2021Updated 4 years ago
ZhengkunTian / Speech-Tranformer-Pytorch
View on GitHub
Seq2Seq Speech Recognition with Transformer on Mandarin Chinese
☆117Dec 20, 2019Updated 6 years ago
TeaPoly / CTC-OptimizedLoss
View on GitHub
Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.
☆59Sep 6, 2023Updated 2 years ago
clayandgithub / rnn_cws
View on GitHub
chinese word segmentation based on rnn
☆13Oct 14, 2016Updated 9 years ago
mobvoi / lstm_ctc
View on GitHub
LSTM CTC End2End Speech Recognition.
☆38Apr 2, 2019Updated 7 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
dqqcasia / mosst
View on GitHub
☆27Aug 31, 2022Updated 3 years ago
HaoranMiao / streaming-attention
View on GitHub
streaming attention networks for end-to-end automatic speech recognition
☆56May 6, 2020Updated 6 years ago
vectominist / End-to-end-ASR-Pytorch-DLHLP
View on GitHub
Joint CTC-Attention End-to-end Speech Recognition - PyTorch Implementation (Deep Learning for Human Language Processing Special Project)
☆17Nov 22, 2020Updated 5 years ago
Narasimha1997 / wavenet-stt
View on GitHub
An end-to-end speech recognition system with Wavenet. Built using C++ and python.
☆21Sep 21, 2020Updated 5 years ago
DemisEom / RNNT-pytorch
View on GitHub
Implementaion RNN tranceducer
☆23Jun 25, 2019Updated 7 years ago
pigzach / MagicSpeechASR
View on GitHub
magicspeech competition recipe
☆18Jun 29, 2020Updated 6 years ago
lzuwei / end-to-end-multiview-lipreading
View on GitHub
End to End Multiview Lip Reading
☆10Jan 26, 2018Updated 8 years ago
MingLunHan / CIF-PyTorch
View on GitHub
[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-…
☆78Jul 14, 2026Updated 2 weeks ago
MarkWuNLP / SemanticMask
View on GitHub
The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"
☆39Jun 9, 2020Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
luckynote / openpose
View on GitHub
OpenPose: A Real-Time Multi-Person Keypoint Detection And Multi-Threading C++ Library
☆11Jul 13, 2017Updated 9 years ago
diaoenmao / Speech-Emotion-Recognition-with-Dual-Sequence-LSTM-Architecture
View on GitHub
[ICASSP 2020] Speech Emotion Recognition with Dual-Sequence LSTM Architecture
☆12Jan 17, 2025Updated last year
oshindow / Transformer-Transducer
View on GitHub
A pytorch_lightning reimplementation of the Transducer module from ESPnet.
☆78Mar 11, 2021Updated 5 years ago
hirofumi0810 / neural_sp
View on GitHub
End-to-end ASR/LM implementation with PyTorch
☆594Aug 30, 2021Updated 4 years ago
George0828Zhang / torch_cif
View on GitHub
A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/ab…
☆37Feb 10, 2024Updated 2 years ago
jctian98 / e2e_lfmmi
View on GitHub
E2E system with LF-MMI; word N-gram for Mandarin
☆167Apr 29, 2022Updated 4 years ago
upskyy / Transformer-Transducer
View on GitHub
PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…
☆114Feb 27, 2022Updated 4 years ago
sooftware / deepspeech2
View on GitHub
PyTorch implementation of "Deep Speech 2: End-to-End Speech Recognition in English and Mandarin" (ICML, 2016)
☆29Mar 5, 2021Updated 5 years ago
gentaiscool / end2end-asr-pytorch
View on GitHub
End-to-End Automatic Speech Recognition on PyTorch
☆304Jun 2, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
YosukeHiguchi / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆16Jan 20, 2025Updated last year
xing96 / MIM-lipreading
View on GitHub
Code and model for paper <Mutual Information Maximization for Effective Lip Reading>
☆19Sep 4, 2020Updated 5 years ago
xiaoquanjie / netlib2
View on GitHub
跨平台网络库，使用epoll和iocp模型
☆11May 13, 2018Updated 8 years ago
ZhengkunTian / OpenTransformer
View on GitHub
A No-Recurrence Sequence-to-Sequence Model for Speech Recognition
☆378Jul 21, 2022Updated 4 years ago
kaituoxu / Speech-Transformer
View on GitHub
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
☆810Apr 6, 2023Updated 3 years ago
DaoZhang0123 / compareCTCDecoder
View on GitHub
compare three CTC decoder, that is greedy decoder, beam decoder and prefix beam decoder
☆20Jul 10, 2018Updated 8 years ago
cywang97 / StreamingTransformer
View on GitHub
☆277Jan 15, 2021Updated 5 years ago