desh2608 / dnn-hmm-asr
Hybrid DNN-HMM model for isolated digit recognition
☆32Updated 4 years ago
Alternatives and similar repositories for dnn-hmm-asr:
Users that are interested in dnn-hmm-asr are comparing it to the libraries listed below
- Python implementation of simple GMM and HMM models for isolated digit recognition.☆63Updated 4 years ago
- ☆60Updated 4 years ago
- Mining effective negative training samples for keyword spotting (PyTorch)☆61Updated 4 years ago
- Listen, Attend and Spell - PyTorch Implementation☆17Updated 6 years ago
- an Audio-Visual Voice Activity Detection using Deep Learning☆48Updated 6 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆65Updated 6 years ago
- PyTorch implementation of Densely Connected Time Delay Neural Network☆88Updated last year
- implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"☆36Updated 5 years ago
- fast SpecAugmentation code with numpy and scipy☆30Updated 5 years ago
- 💬 A list of End-to-End speech recognition, including papers, codes and other materials☆51Updated 6 years ago
- ☆99Updated 7 years ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 6 years ago
- ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…☆38Updated 5 years ago
- PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and …☆145Updated 5 years ago
- Implementaion RNN tranceducer☆22Updated 5 years ago
- ☆15Updated 5 years ago
- Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP…☆59Updated 4 years ago
- Seq2Seq Speech Recognition with Transformer on Mandarin Chinese☆116Updated 5 years ago
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆55Updated 4 years ago
- Implementation of the work presented in "CNN based Query by Example Spoken Term Detection"☆32Updated 6 years ago
- ☆55Updated 4 years ago
- Deep Discriminative Embeddings for Duration Robust Speaker Verification☆19Updated 5 years ago
- VoxSRC Challenge☆31Updated 5 years ago
- Transformer-based online speech recognition system with TensorFlow 2☆26Updated 4 years ago
- Implementation of the paper "Attentive Statistics Pooling for Deep Speaker Embedding" in Pytorch☆43Updated 4 years ago
- Discriminative Neural Clustering for Speaker Diarisation☆78Updated 3 years ago
- ☆35Updated 6 years ago
- CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.☆73Updated 5 years ago
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆44Updated last year
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆57Updated last year