mravanelli / pytorch_MLP_for_ASRLinks
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.
☆38Updated 7 years ago
Alternatives and similar repositories for pytorch_MLP_for_ASR
Users that are interested in pytorch_MLP_for_ASR are comparing it to the libraries listed below
Sorting:
- Python implementation of pre-processing for End-to-End speech recognition☆69Updated 7 years ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 7 years ago
- Speaker embedding(verification and recognition) using Tensorflow with Kaldi☆41Updated 7 years ago
- Robust Speech Recognition Using Generative Adversarial Networks (GAN)☆59Updated 5 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆95Updated 5 years ago
- This is a project on working/resolving the speech separation problem using supervised learning on various training targets, building mach…☆34Updated 8 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- ☆60Updated 4 years ago
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 5 years ago
- 💬 A list of End-to-End speech recognition, including papers, codes and other materials☆52Updated 6 years ago
- Develop speaker recognition model based on i-vector using TIMIT database☆17Updated 6 years ago
- ☆35Updated 6 years ago
- A set of speech feature extraction functions for ASR and speaker identification written in matlab.☆43Updated 8 years ago
- RASTA-PLP and MFCC tool based rasta-mat☆33Updated 3 years ago
- ☆99Updated 7 years ago
- Tensorflow implementation of "Speaker-independent Speech Separation with Deep Attractor Network"☆90Updated 4 years ago
- ☆131Updated 7 years ago
- ☆30Updated 6 years ago
- VoxCeleb plugin for pyannote.database☆30Updated 4 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆46Updated 5 years ago
- speech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN☆35Updated 7 years ago
- PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and …☆148Updated 5 years ago
- PyTorch implementation of a Time Delay Neural Network (TDNN)☆41Updated 6 years ago
- Implementation of state of the art d-vector approach for speaker verification☆127Updated 7 years ago
- A pytorch implementation of xvector embedding☆79Updated 5 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆66Updated 6 years ago
- GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…☆64Updated 5 years ago
- ☆54Updated 6 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆22Updated 6 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆103Updated 6 years ago