Conv-LSTM-CTC speech recognition network (end-to-end), written in TensorFlow.
☆72Mar 21, 2019Updated 7 years ago
Alternatives and similar repositories for kaggle_speech_recognition
Users that are interested in kaggle_speech_recognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- "Automated Speech Recognition System" in Machine Learning and Having it Deep and Structured, Spring 2015☆21Nov 25, 2016Updated 9 years ago
- DeepSpeech, Speech To Text, ASR, Speech recognition, Keras, Tensorflow☆30Jan 16, 2018Updated 8 years ago
- ☆12Aug 25, 2017Updated 8 years ago
- End-to-End speech recognition implementation base on TensorFlow (CTC, Attention, and MTL training)☆314Jan 23, 2018Updated 8 years ago
- End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.☆123Apr 15, 2020Updated 5 years ago
- Automatically exported from code.google.com/p/transducersaurus☆11Apr 1, 2015Updated 10 years ago
- Listen, Attend and Spell (LAS) framework for speech recognition (see https://arxiv.org/pdf/1508.01211.pdf).☆32Jun 27, 2019Updated 6 years ago
- Application of Connectionist Temporal Classification (CTC) for Speech Recognition (Tensorflow 1.0 but compatible with 2.0).☆131Mar 4, 2021Updated 5 years ago
- 采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。☆70Jan 26, 2019Updated 7 years ago
- LSTM CTC End2End Speech Recognition.☆38Apr 2, 2019Updated 6 years ago
- A punctuation transcription model to automatically add punctuation marks in an unpunctuated sentence or sentences.☆15Aug 6, 2020Updated 5 years ago
- Hybrid DNN-HMM model for isolated digit recognition☆32Dec 1, 2020Updated 5 years ago
- Tensorflow implementation of "Listen, Attend and Spell" authored by William Chan. This project utilizes input pipeline and estimator API …☆89Jan 31, 2019Updated 7 years ago
- code for 3rd place kaggle tensorflow competition☆98Apr 12, 2018Updated 7 years ago
- A Keras CTC implementation of Baidu's DeepSpeech for model experimentation☆243Mar 17, 2018Updated 8 years ago
- PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).☆39Jul 25, 2019Updated 6 years ago
- Text-to-Speech Synthesis by Generating Spectrograms using Generative Adversarial Network☆10Dec 12, 2018Updated 7 years ago
- FFTNet vocoder implementation☆81Sep 28, 2018Updated 7 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 7 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆40Feb 10, 2018Updated 8 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆80Oct 14, 2019Updated 6 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Feb 13, 2020Updated 6 years ago
- A Tensorflow Implementation like "Neural Speech Synthesis with Transformer Network" Port From OpenSeq2Seq☆20Jul 6, 2023Updated 2 years ago
- PyTorch implementation of "Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition"☆19Jul 19, 2019Updated 6 years ago
- Custom decoders for Kaldi☆80Jun 10, 2019Updated 6 years ago
- Adaptive and Focusing Neural Layers for Multi-Speaker Separation Problem☆51Jul 7, 2018Updated 7 years ago
- Custom decoders for Kaldi☆13Jun 5, 2019Updated 6 years ago
- End-to-End Attention-Based Large Vocabulary Speech Recognition☆265Nov 22, 2022Updated 3 years ago
- A simple implementation of the paper https://arxiv.org/pdf/1910.00716v1.pdf☆31Feb 10, 2022Updated 4 years ago
- Robust Speech Recognition Using Generative Adversarial Networks (GAN)☆59Nov 25, 2019Updated 6 years ago
- Feature extraction for accented-speech or pathological speech☆18Apr 2, 2019Updated 6 years ago
- Open solution to the Cdiscount’s Image Classification Challenge☆19Jun 22, 2022Updated 3 years ago
- Server framework for Kaldi ASR Toolkit☆99Sep 17, 2023Updated 2 years ago
- ☆42Jun 25, 2018Updated 7 years ago
- Share some recent speaker recognition papers and their implementations.☆90Sep 26, 2019Updated 6 years ago
- Tensorflow Optimizers☆11Sep 1, 2019Updated 6 years ago
- MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks☆139Jun 7, 2021Updated 4 years ago
- PyTorch end-to-end speech recognition☆49Dec 30, 2020Updated 5 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago