katsugeneration / tensor-fsmn
Feedforward Sequential Memory Networks (FSMN) implemented by tensorflow
☆51Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for tensor-fsmn
- Tensorflow version of DFSMN☆48Updated 6 years ago
- implement end-to-end asr algorithm with tensorflow☆40Updated 6 years ago
- Open Source WFST-based Decoder Toolkit☆76Updated 8 years ago
- This is a working example of using CTC for phone recognition on TIMIT☆50Updated 7 years ago
- An implementation of rnn transducer for sequence labeling problem☆22Updated 6 years ago
- Conversion of recurrent neural network language models to weighted finite state transducers☆55Updated 6 years ago
- ☆55Updated 4 years ago
- LSTM CTC End2End Speech Recognition.☆38Updated 5 years ago
- Code for the paper: Deep Residual Networks with Auditory Inspired Features for Robust Speech Recognition.☆21Updated 7 years ago
- Listen, Attend and Spell (LAS) framework for speech recognition (see https://arxiv.org/pdf/1508.01211.pdf).☆31Updated 5 years ago
- ☆41Updated 6 years ago
- ChiNese Text Normalization (CNTN) tool for Text-to-speech system☆35Updated 6 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆63Updated 5 years ago
- Minimize kaldi nnet3 chain decoder☆45Updated 4 years ago
- A CUDA-C implementation of FOFE and FSMN☆20Updated 8 years ago
- Custom decoders for Kaldi☆80Updated 5 years ago
- Implementations for FSMN (Feedforward Sequential Memory Network), cFSMN, DFSMN, and PFSMN units☆9Updated 6 years ago
- MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks☆136Updated 3 years ago
- Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch☆22Updated 4 years ago
- ASR for Chinese Mandarin☆75Updated 6 years ago
- A python package that make tensorflow be able to read "Kaldi" scp/ark in an elegant way. May kaldi user happy to enter tensorflow world.☆40Updated 5 years ago
- Seq2Seq Speech Recognition with Transformer on Mandarin Chinese☆115Updated 4 years ago
- TristouNet: Triplet Loss for Speaker Turn Embedding☆125Updated 7 years ago
- Robust Speech Recognition Using Generative Adversarial Networks (GAN)☆58Updated 4 years ago
- MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.☆78Updated 5 years ago
- ☆16Updated 5 years ago
- compare three CTC decoder, that is greedy decoder, beam decoder and prefix beam decoder☆20Updated 6 years ago
- Automatic Speech Recognition using Tensorflow☆46Updated 7 years ago
- This is now the official location of the Kaldi project.☆22Updated 5 years ago