awni/speech

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/awni/speech)

awni / speech

A PyTorch Implementation of End-to-End Models for Speech-to-Text

☆768

Alternatives and similar repositories for speech

Users that are interested in speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HawkAaron / E2E-ASR
View on GitHub
PyTorch Implementations for End-to-End Automatic Speech Recognition
☆127Jun 10, 2019Updated 7 years ago
Alexander-H-Liu / End-to-end-ASR-Pytorch
View on GitHub
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pyt…
☆1,210Dec 19, 2020Updated 5 years ago
awni / transducer
View on GitHub
A Fast Sequence Transducer Implementation with PyTorch Bindings
☆200Sep 20, 2022Updated 3 years ago
hirofumi0810 / neural_sp
View on GitHub
End-to-end ASR/LM implementation with PyTorch
☆594Aug 30, 2021Updated 4 years ago
SeanNaren / deepspeech.pytorch
View on GitHub
Speech Recognition using DeepSpeech2.
☆2,136Dec 13, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
HawkAaron / RNN-Transducer
View on GitHub
MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
☆140Jun 7, 2021Updated 5 years ago
HawkAaron / warp-transducer
View on GitHub
A fast parallel implementation of RNN Transducer.
☆314Jun 7, 2023Updated 3 years ago
mravanelli / pytorch-kaldi
View on GitHub
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…
☆2,399Mar 14, 2022Updated 4 years ago
srvk / eesen
View on GitHub
The official repository of the Eesen project
☆834May 23, 2019Updated 7 years ago
1ytic / warp-rnnt
View on GitHub
CUDA-Warp RNN-Transducer
☆216Feb 22, 2023Updated 3 years ago
YiwenShaoStephen / pychain
View on GitHub
PyTorch implementation of LF-MMI for End-to-end ASR
☆221Jan 14, 2021Updated 5 years ago
kaituoxu / Speech-Transformer
View on GitHub
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
☆810Apr 6, 2023Updated 3 years ago
jinserk / pytorch-asr
View on GitHub
ASR with PyTorch
☆139Mar 10, 2019Updated 7 years ago
gentaiscool / end2end-asr-pytorch
View on GitHub
End-to-End Automatic Speech Recognition on PyTorch
☆304Jun 2, 2022Updated 4 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
Diamondfan / CTC_pytorch
View on GitHub
CTC end -to-end ASR for timit and 863 corpus.
☆219Dec 20, 2019Updated 6 years ago
freewym / espresso
View on GitHub
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
☆939Sep 4, 2024Updated last year
parlance / ctcdecode
View on GitHub
PyTorch CTC Decoder bindings
☆860Apr 4, 2024Updated 2 years ago
ZhengkunTian / rnn-transducer
View on GitHub
A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition
☆239May 12, 2020Updated 6 years ago
syhw / wer_are_we
View on GitHub
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
☆1,864Jun 27, 2022Updated 4 years ago
espnet / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆9,898Updated this week
hirofumi0810 / tensorflow_end2end_speech_recognition
View on GitHub
End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)
☆314Jan 23, 2018Updated 8 years ago
pykaldi / pykaldi
View on GitHub
A Python wrapper for Kaldi
☆1,038Nov 30, 2025Updated 7 months ago
wenet-e2e / speech-recognition-papers
View on GitHub
Towards hot directions in industrial end to end speech recognition
☆329Nov 30, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
buriburisuri / speech-to-text-wavenet
View on GitHub
Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow
☆4,005Oct 8, 2021Updated 4 years ago
KarelVesely84 / kaldi-io-for-python
View on GitHub
Python functions for reading kaldi data formats. Useful for rapid prototyping with python.
☆378Jun 16, 2023Updated 3 years ago
theblackcat102 / edgedict
View on GitHub
Working online speech recognition based on RNN Transducer. ( Trained model release available in release )
☆292Aug 5, 2021Updated 4 years ago
zzw922cn / Automatic_Speech_Recognition
View on GitHub
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
☆2,834Mar 24, 2023Updated 3 years ago
rizar / attention-lvcsr
View on GitHub
End-to-End Attention-Based Large Vocabulary Speech Recognition
☆265Nov 22, 2022Updated 3 years ago
nttcslab-sp / kaldiio
View on GitHub
A pure python module for reading and writing kaldi ark files
☆268Mar 6, 2025Updated last year
cywang97 / StreamingTransformer
View on GitHub
☆277Jan 15, 2021Updated 5 years ago
zzw922cn / awesome-speech-recognition-speech-synthesis-papers
View on GitHub
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synth…
☆3,124Oct 19, 2023Updated 2 years ago
mdangschat / ctc-asr
View on GitHub
End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
☆123Apr 15, 2020Updated 6 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
csukuangfj / optimized_transducer
View on GitHub
Memory efficient transducer loss computation
☆70Jun 10, 2022Updated 4 years ago
flashlight / wav2letter
View on GitHub
Facebook AI Research's Automatic Speech Recognition Toolkit
☆6,440Jul 14, 2026Updated last week
zh217 / torch-asg
View on GitHub
Auto Segmentation Criterion (ASG) implemented in pytorch
☆51Oct 1, 2021Updated 4 years ago
robmsmt / KerasDeepSpeech
View on GitHub
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
☆242Mar 17, 2018Updated 8 years ago
mindorii / kws
View on GitHub
An End-to-End Architecture for Keyword Spotting and Voice Activity Detection
☆387Mar 24, 2023Updated 3 years ago
ZhengkunTian / OpenTransformer
View on GitHub
A No-Recurrence Sequence-to-Sequence Model for Speech Recognition
☆378Jul 21, 2022Updated 4 years ago
githubharald / CTCDecoder
View on GitHub
Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing…
☆837Jan 31, 2026Updated 5 months ago