A PyTorch Implementation of End-to-End Models for Speech-to-Text
☆769Jul 6, 2023Updated 2 years ago
Alternatives and similar repositories for speech
Users that are interested in speech are comparing it to the libraries listed below
Sorting:
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆127Jun 10, 2019Updated 6 years ago
- This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pyt…☆1,212Dec 19, 2020Updated 5 years ago
- A Fast Sequence Transducer Implementation with PyTorch Bindings☆199Sep 20, 2022Updated 3 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- Speech Recognition using DeepSpeech2.☆2,139Dec 13, 2022Updated 3 years ago
- A fast parallel implementation of RNN Transducer.☆314Jun 7, 2023Updated 2 years ago
- MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks☆139Jun 7, 2021Updated 4 years ago
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,396Mar 14, 2022Updated 3 years ago
- The official repository of the Eesen project☆833May 23, 2019Updated 6 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆220Jan 14, 2021Updated 5 years ago
- CUDA-Warp RNN-Transducer☆216Feb 22, 2023Updated 3 years ago
- ASR with PyTorch☆140Mar 10, 2019Updated 6 years ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆808Apr 6, 2023Updated 2 years ago
- End-to-End Automatic Speech Recognition on PyTorch☆304Jun 2, 2022Updated 3 years ago
- CTC end -to-end ASR for timit and 863 corpus.☆219Dec 20, 2019Updated 6 years ago
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆940Sep 4, 2024Updated last year
- PyTorch CTC Decoder bindings☆855Apr 4, 2024Updated last year
- A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition☆239May 12, 2020Updated 5 years ago
- End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)☆314Jan 23, 2018Updated 8 years ago
- Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.☆1,867Jun 27, 2022Updated 3 years ago
- ☆276Jan 15, 2021Updated 5 years ago
- A Python wrapper for Kaldi☆1,030Nov 30, 2025Updated 3 months ago
- Towards hot directions in industrial end to end speech recognition☆332Nov 30, 2021Updated 4 years ago
- End-to-End Speech Processing Toolkit☆9,747Updated this week
- Working online speech recognition based on RNN Transducer. ( Trained model release available in release )☆292Aug 5, 2021Updated 4 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆378Jun 16, 2023Updated 2 years ago
- End-to-End Attention-Based Large Vocabulary Speech Recognition☆265Nov 22, 2022Updated 3 years ago
- Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow☆4,009Oct 8, 2021Updated 4 years ago
- An End-to-End Architecture for Keyword Spotting and Voice Activity Detection☆381Mar 24, 2023Updated 2 years ago
- A No-Recurrence Sequence-to-Sequence Model for Speech Recognition☆379Jul 21, 2022Updated 3 years ago
- Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synth…☆3,114Oct 19, 2023Updated 2 years ago
- Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing…☆835Jan 31, 2026Updated last month
- End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.☆123Apr 15, 2020Updated 5 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆104Nov 26, 2022Updated 3 years ago
- Tools for handling multimodal data in machine learning projects.☆1,114Updated this week
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,386Jun 6, 2024Updated last year
- Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP☆1,559May 11, 2021Updated 4 years ago
- A pure python module for reading and writing kaldi ark files☆267Mar 6, 2025Updated 11 months ago
- Facebook AI Research's Automatic Speech Recognition Toolkit☆6,446Jan 12, 2026Updated last month