zweiein / End_to_end_Speech_Papers
☆13Updated 7 years ago
Related projects ⓘ
Alternatives and complementary repositories for End_to_end_Speech_Papers
- compare three CTC decoder, that is greedy decoder, beam decoder and prefix beam decoder☆20Updated 6 years ago
- Listen, Attend and Spell (LAS) framework for speech recognition (see https://arxiv.org/pdf/1508.01211.pdf).☆31Updated 5 years ago
- magicspeech competition recipe☆18Updated 4 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆63Updated 5 years ago
- LSTM CTC End2End Speech Recognition.☆38Updated 5 years ago
- implement end-to-end asr algorithm with tensorflow☆40Updated 6 years ago
- ☆55Updated 4 years ago
- Coordinate-wise meta-learner for speaker adaptation of ASR models.☆20Updated 4 years ago
- Region proposal network based small-footprint keyword spotting (Pytorch)☆52Updated last year
- solutions for https://www.kaggle.com/c/tensorflow-speech-recognition-challenge☆32Updated 6 years ago
- ☆41Updated 6 years ago
- Implementation of a speaker identification and a speaker verification system based on Gaussian Mixture Models (GMM) in combination with a…☆21Updated 6 years ago
- Speaker embedding(verification and recognition) using Tensorflow with Kaldi☆41Updated 7 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Updated 6 years ago
- Old language modeling tool that's used in kaldi☆16Updated last year
- 💬 A list of End-to-End speech recognition, including papers, codes and other materials☆53Updated 5 years ago
- PyTorch bindings for Warp-CTC☆42Updated 4 years ago
- Robust Speech Recognition Using Generative Adversarial Networks (GAN)☆58Updated 4 years ago
- Kaldi extended by Kaituo XU with new features in nnet1.☆12Updated 5 years ago
- A KALDI/C++ implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆14Updated 5 years ago
- Pytorch Bindings for warp-ctc maintained by ESPnet☆19Updated 3 years ago
- Conversion of recurrent neural network language models to weighted finite state transducers☆55Updated 6 years ago
- VoxSRC Challenge☆31Updated 5 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Updated 5 years ago
- Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch☆22Updated 4 years ago
- An implementation of rnn transducer for sequence labeling problem☆22Updated 6 years ago
- Tensorflow version of DFSMN☆48Updated 6 years ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Updated 6 years ago
- Code for the paper: Deep Residual Networks with Auditory Inspired Features for Robust Speech Recognition.☆21Updated 7 years ago