zweiein / End_to_end_Speech_Papers
☆13Updated 7 years ago
Related projects: ⓘ
- compare three CTC decoder, that is greedy decoder, beam decoder and prefix beam decoder☆19Updated 6 years ago
- Listen, Attend and Spell (LAS) framework for speech recognition (see https://arxiv.org/pdf/1508.01211.pdf).☆31Updated 5 years ago
- Old language modeling tool that's used in kaldi☆16Updated last year
- Region proposal network based small-footprint keyword spotting (Pytorch)☆51Updated 10 months ago
- Coordinate-wise meta-learner for speaker adaptation of ASR models.☆20Updated 4 years ago
- SE-Resnet+AMSoftmax for Speaker Verification☆47Updated 5 years ago
- ☆9Updated 6 years ago
- solutions for https://www.kaggle.com/c/tensorflow-speech-recognition-challenge☆32Updated 6 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Updated 6 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆62Updated 5 years ago
- LSTM CTC End2End Speech Recognition.☆38Updated 5 years ago
- CTC+Beam_Search+kenlm 是用于以汉字为声学模型建模单元的解码系统☆44Updated 6 years ago
- ☆98Updated 6 years ago
- magicspeech competition recipe☆18Updated 4 years ago
- Code for the paper: Deep Residual Networks with Auditory Inspired Features for Robust Speech Recognition.☆21Updated 7 years ago
- Feedforward Sequential Memory Networks (FSMN) implemented by tensorflow☆52Updated 7 years ago
- Speaker embedding(verification and recognition) using Tensorflow with Kaldi☆41Updated 7 years ago
- Example implementation of Monotonic Chunkwise Attention.☆49Updated 6 years ago
- An implementation of rnn transducer for sequence labeling problem☆22Updated 6 years ago
- ☆16Updated this week
- Implementations for FSMN (Feedforward Sequential Memory Network), cFSMN, DFSMN, and PFSMN units☆9Updated 5 years ago
- Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch☆21Updated 4 years ago
- ☆55Updated 4 years ago
- ☆86Updated this week
- ☆27Updated 6 years ago
- "Automated Speech Recognition System" in Machine Learning and Having it Deep and Structured, Spring 2015☆20Updated 7 years ago
- Code and instruction on replicating the experiments done in paper: Unified Hypersphere Embedding for Speaker Recognition☆31Updated 5 years ago
- VoxSRC Challenge☆31Updated 5 years ago
- This is a working example of using CTC for phone recognition on TIMIT☆50Updated 6 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Updated 5 years ago