mravanelli / pytorch_MLP_for_ASR
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.
☆38Updated 7 years ago
Alternatives and similar repositories for pytorch_MLP_for_ASR:
Users that are interested in pytorch_MLP_for_ASR are comparing it to the libraries listed below
- Voxceleb1 i-vector based speaker recognition system☆43Updated 6 years ago
- Python implementation of pre-processing for End-to-End speech recognition☆69Updated 7 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆95Updated 4 years ago
- Speaker embedding(verification and recognition) using Tensorflow with Kaldi☆41Updated 7 years ago
- Robust Speech Recognition Using Generative Adversarial Networks (GAN)☆59Updated 5 years ago
- ☆60Updated 4 years ago
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆22Updated 6 years ago
- This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…☆28Updated 5 years ago
- VoxCeleb plugin for pyannote.database☆29Updated 3 years ago
- "An Improved Deep Embedding Learning Method for Short Duration Speaker Verification" pytorch implementation☆19Updated 6 years ago
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 5 years ago
- Pytorch based phoneme recognition (TIMIT phoneme classification)☆34Updated 6 years ago
- This is a project on working/resolving the speech separation problem using supervised learning on various training targets, building mach…☆34Updated 7 years ago
- Interspeech 2019 tutorial materials☆48Updated 5 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆65Updated 6 years ago
- ☆38Updated 4 years ago
- Keras-based python framework to compute phonological posterior probabilities from audio files☆41Updated 2 years ago
- ☆35Updated 6 years ago
- This repository is an extension of GAN based speech enhancement called SEGAN, and we present two modifications to make model training mor…☆37Updated 2 years ago
- A set of speech feature extraction functions for ASR and speaker identification written in matlab.☆43Updated 8 years ago
- ABX and kaldi experiments on speech corpora made easy☆32Updated 6 months ago
- GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…☆64Updated 5 years ago
- Filtering and Noise Adding Tool☆29Updated 2 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Updated 6 years ago
- Speech separation with utterance-level PIT experiments☆104Updated 6 years ago
- Inspired work by the project of SER using ELM at Microsoft Research☆19Updated 6 years ago
- Deep Discriminative Embeddings for Duration Robust Speaker Verification☆19Updated 5 years ago
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…☆54Updated last year
- ☆48Updated 4 years ago
- VoxSRC Challenge☆31Updated 5 years ago