End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
☆123Apr 15, 2020Updated 5 years ago
Alternatives and similar repositories for ctc-asr
Users that are interested in ctc-asr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆127Jun 10, 2019Updated 6 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- 采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。☆70Jan 26, 2019Updated 7 years ago
- Download and preperation tool for free speech corpora.☆16Apr 28, 2019Updated 6 years ago
- End-to-end speech recognition using RNN Transducers in Tensorflow 2.0☆249Jul 15, 2025Updated 8 months ago
- MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks☆139Jun 7, 2021Updated 4 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- ESPnet-TTS Audio Sample HP☆21Oct 25, 2019Updated 6 years ago
- Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch☆26Jul 25, 2024Updated last year
- A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition☆239May 12, 2020Updated 5 years ago
- Tensorflow implementation of "Listen, Attend and Spell" authored by William Chan. This project utilizes input pipeline and estimator API …☆89Jan 31, 2019Updated 7 years ago
- This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pyt…☆1,212Dec 19, 2020Updated 5 years ago
- End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)☆314Jan 23, 2018Updated 8 years ago
- ASR with PyTorch☆140Mar 10, 2019Updated 7 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆103Mar 18, 2019Updated 7 years ago
- ☆277Jan 15, 2021Updated 5 years ago
- Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.☆72Mar 21, 2019Updated 7 years ago
- 基于卷积神经网络的语音识别声学模型的研究☆181Jul 22, 2019Updated 6 years ago
- it's a train acoustics model code lib☆27May 20, 2020Updated 5 years ago
- Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation☆39Jul 16, 2020Updated 5 years ago
- A PyTorch Implementation of End-to-End Models for Speech-to-Text☆769Jul 6, 2023Updated 2 years ago
- ICASSP 2020 ESPnet-TTS: Merlin baseline system☆36Oct 28, 2019Updated 6 years ago
- This is a single-speaker neural text-to-speech (TTS) system capable of training in a end-to-end fashion. It is inspired by the Tacotron a…☆13Dec 28, 2018Updated 7 years ago
- CTC end -to-end ASR for timit and 863 corpus.☆219Dec 20, 2019Updated 6 years ago
- A fast parallel implementation of RNN Transducer.☆314Jun 7, 2023Updated 2 years ago
- mixlingual speech recognition system; hybrid (GMM+NNet) model; Kaldi + Keras☆71Nov 20, 2017Updated 8 years ago
- PyTorch end-to-end speech recognition☆49Dec 30, 2020Updated 5 years ago
- DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)☆301Jun 15, 2020Updated 5 years ago
- 🎧 Automatic Speech Recognition: DeepSpeech & Seq2Seq (TensorFlow)☆223Jun 15, 2020Updated 5 years ago
- implement end-to-end asr algorithm with tensorflow☆40Aug 23, 2018Updated 7 years ago
- ☆34Jul 16, 2019Updated 6 years ago
- Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch☆23Jul 28, 2020Updated 5 years ago
- A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.☆207Jan 8, 2019Updated 7 years ago
- A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.☆15Aug 6, 2020Updated 5 years ago
- CTC Decoder implementation with python only. Also supports language model decoding using KenLM.☆37May 3, 2024Updated last year
- The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"☆39Jun 9, 2020Updated 5 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Feb 13, 2020Updated 6 years ago
- Code for Temporal Convolution for Real-time Keyword Spotting on Mobile Devices☆232Mar 24, 2023Updated 3 years ago
- The RWTH extensible training framework for universal recurrent neural networks☆373Mar 17, 2026Updated last week