mobvoi / lstm_ctc
LSTM CTC End2End Speech Recognition.
☆38Updated 5 years ago
Alternatives and similar repositories for lstm_ctc:
Users that are interested in lstm_ctc are comparing it to the libraries listed below
- ☆55Updated 4 years ago
- Minimize kaldi nnet3 chain decoder☆45Updated 5 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆65Updated 6 years ago
- Mining effective negative training samples for keyword spotting (PyTorch)☆59Updated 4 years ago
- Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch☆22Updated 4 years ago
- MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks☆137Updated 3 years ago
- ☆41Updated 6 years ago
- Listen, Attend and Spell - PyTorch Implementation☆17Updated 6 years ago
- Tensorflow version of DFSMN☆49Updated 6 years ago
- Region proposal network based small-footprint keyword spotting (Pytorch)☆54Updated last year
- An LDA/PLDA estimator using KALDI in python for speaker verification tasks☆99Updated 7 years ago
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆126Updated 5 years ago
- Recurrent Neural Aligner☆49Updated 4 years ago
- mWER loss implementation in tensorflow☆31Updated 4 years ago
- A pytorch based end2end speech recognition system.☆112Updated 4 years ago
- CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统☆46Updated 6 years ago
- Memory efficient transducer loss computation☆68Updated 2 years ago
- This is a working example of using CTC for phone recognition on TIMIT☆51Updated 7 years ago
- Implementation of the work presented in "CNN based Query by Example Spoken Term Detection"☆32Updated 6 years ago
- Implementaion RNN tranceducer☆21Updated 5 years ago
- it's a train acoustics model code lib☆26Updated 4 years ago
- Implementations for FSMN (Feedforward Sequential Memory Network), cFSMN, DFSMN, and PFSMN units☆9Updated 6 years ago
- ☆9Updated 6 years ago
- ☆61Updated 2 years ago
- 以音素建模构建NN-CTC声学模型☆15Updated 5 years ago
- Code for the paper: Deep Residual Networks with Auditory Inspired Features for Robust Speech Recognition.☆21Updated 7 years ago
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆57Updated last year
- Feedforward Sequential Memory Networks (FSMN) implemented by tensorflow☆50Updated 8 years ago
- 💬 A list of End-to-End speech recognition, including papers, codes and other materials☆52Updated 5 years ago
- Listen, Attend and Spell (LAS) framework for speech recognition (see https://arxiv.org/pdf/1508.01211.pdf).☆31Updated 5 years ago