This is a working example of using CTC for phone recognition on TIMIT
☆50Oct 19, 2017Updated 8 years ago
Alternatives and similar repositories for CTC-speech-recognition
Users that are interested in CTC-speech-recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)☆314Jan 23, 2018Updated 8 years ago
- Code for end-to-end ASR with neural networks, build with TensorFlow☆110Jan 24, 2019Updated 7 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆47Jun 24, 2020Updated 5 years ago
- Pytorch based phoneme recognition (TIMIT phoneme classification)☆35Apr 25, 2018Updated 7 years ago
- All you need to get started for the Zero Speech Challenge 2017☆47Apr 23, 2019Updated 6 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆16Jun 13, 2022Updated 3 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 7 years ago
- A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popu…☆19Jan 18, 2018Updated 8 years ago
- CNN learns feature mapping between corrupted and clean speech☆12Aug 14, 2017Updated 8 years ago
- A collection of programming notebooks that I've created.☆16Oct 18, 2020Updated 5 years ago
- Speech recognition on the TIMIT (or any other) dataset☆44Nov 2, 2017Updated 8 years ago
- Time-domain Audio Separation Network☆24Aug 3, 2018Updated 7 years ago
- Audio Visual Speech Recognition☆23Aug 9, 2017Updated 8 years ago
- The official repository of the Eesen project☆204Aug 9, 2016Updated 9 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Portal of Johannes and Felix's RNN implementation and further modifications for ASR☆21Nov 27, 2014Updated 11 years ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- ASR for Chinese Mandarin☆76Jun 1, 2018Updated 7 years ago
- This is now the official location of the Kaldi project.☆27Jun 13, 2016Updated 9 years ago
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆15Sep 4, 2019Updated 6 years ago
- Educational tutorials for speech and language processing classes☆12Jan 8, 2019Updated 7 years ago
- Custom decoders for Kaldi☆13Jun 5, 2019Updated 6 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.☆65Jan 2, 2020Updated 6 years ago
- End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow☆20Mar 29, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Easier analysis of large speech corpora☆23Jun 22, 2021Updated 4 years ago
- 다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.☆13Nov 27, 2019Updated 6 years ago
- The official repository of the Eesen project☆834May 23, 2019Updated 6 years ago
- Python functions for reading kaldi data formats. Useful for rapid prototyping with python.☆378Jun 16, 2023Updated 2 years ago
- Some deep learning models written with mxnet and C++11.☆12Feb 6, 2018Updated 8 years ago
- ☆38May 13, 2020Updated 5 years ago
- An open-source speech separation and enhancement library☆214May 13, 2020Updated 5 years ago
- Some notes on Kaldi☆31Feb 20, 2015Updated 11 years ago
- Collection of machine learning demos for Automatic Speech Recognition☆55Sep 24, 2021Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆76Mar 18, 2022Updated 4 years ago
- ☆19Feb 28, 2018Updated 8 years ago
- CTC end -to-end ASR for timit and 863 corpus.☆219Dec 20, 2019Updated 6 years ago
- This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pyt…☆1,212Dec 19, 2020Updated 5 years ago
- A pure python module for reading and writing kaldi ark files☆268Mar 6, 2025Updated last year
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Dec 16, 2020Updated 5 years ago
- ☆11Feb 19, 2021Updated 5 years ago