A PyTorch Implementation of End-to-End Models for Speech-to-Text
☆769Jul 6, 2023Updated 2 years ago
Alternatives and similar repositories for speech
Users that are interested in speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆127Jun 10, 2019Updated 6 years ago
- This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pyt…☆1,214Dec 19, 2020Updated 5 years ago
- A Fast Sequence Transducer Implementation with PyTorch Bindings☆200Sep 20, 2022Updated 3 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- Speech Recognition using DeepSpeech2.☆2,140Dec 13, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A fast parallel implementation of RNN Transducer.☆314Jun 7, 2023Updated 2 years ago
- MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks☆139Jun 7, 2021Updated 4 years ago
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,398Mar 14, 2022Updated 4 years ago
- The official repository of the Eesen project☆835May 23, 2019Updated 6 years ago
- CUDA-Warp RNN-Transducer☆216Feb 22, 2023Updated 3 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆221Jan 14, 2021Updated 5 years ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆809Apr 6, 2023Updated 3 years ago
- ASR with PyTorch☆140Mar 10, 2019Updated 7 years ago
- End-to-End Automatic Speech Recognition on PyTorch☆304Jun 2, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition☆239May 12, 2020Updated 5 years ago
- CTC end -to-end ASR for timit and 863 corpus.☆219Dec 20, 2019Updated 6 years ago
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆939Sep 4, 2024Updated last year
- PyTorch CTC Decoder bindings☆857Apr 4, 2024Updated 2 years ago
- Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.☆1,863Jun 27, 2022Updated 3 years ago
- End-to-End Speech Processing Toolkit☆9,801Updated this week
- End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)☆314Jan 23, 2018Updated 8 years ago
- A Python wrapper for Kaldi☆1,033Nov 30, 2025Updated 4 months ago
- Towards hot directions in industrial end to end speech recognition☆330Nov 30, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆378Jun 16, 2023Updated 2 years ago
- Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow☆4,011Oct 8, 2021Updated 4 years ago
- End-to-End Attention-Based Large Vocabulary Speech Recognition☆265Nov 22, 2022Updated 3 years ago
- End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow☆2,839Mar 24, 2023Updated 3 years ago
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆292Aug 5, 2021Updated 4 years ago
- ☆277Jan 15, 2021Updated 5 years ago
- A pure python module for reading and writing kaldi ark files☆268Mar 6, 2025Updated last year
- Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synth…☆3,121Oct 19, 2023Updated 2 years ago
- End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.☆123Apr 15, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Memory efficient transducer loss computation☆70Jun 10, 2022Updated 3 years ago
- Facebook AI Research's Automatic Speech Recognition Toolkit☆6,445Jan 12, 2026Updated 3 months ago
- An End-to-End Architecture for Keyword Spotting and Voice Activity Detection☆385Mar 24, 2023Updated 3 years ago
- A Keras CTC implementation of Baidu's DeepSpeech for model experimentation☆242Mar 17, 2018Updated 8 years ago
- A No-Recurrence Sequence-to-Sequence Model for Speech Recognition☆378Jul 21, 2022Updated 3 years ago
- Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing…☆835Jan 31, 2026Updated 2 months ago
- Auto Segmentation Criterion (ASG) implemented in pytorch☆51Oct 1, 2021Updated 4 years ago