mravanelli / pytorch_MLP_for_ASRLinks

This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and decoding are performed with Kaldi. The current implementation supports dropout and batch normalization. An example for phoneme recognition using the standard TIMIT dataset is provided.

☆38

Alternatives and similar repositories for pytorch_MLP_for_ASR

Users that are interested in pytorch_MLP_for_ASR are comparing it to the libraries listed below

Sorting:

hirofumi0810 / asr_preprocessing
Python implementation of pre-processing for End-to-End speech recognition
☆69Updated 7 years ago
swshon / voxceleb-ivector
Voxceleb1 i-vector based speaker recognition system
☆43Updated 7 years ago
PiotrTa / Huawei-Challenge-Speaker-Identification
Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.
☆36Updated 5 years ago
mravanelli / pySpeechRev
This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…
☆95Updated 5 years ago
JRMeyer / multi-task-kaldi
An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…
☆54Updated 5 years ago
staplesinLA / denoising_DIHARD18
☆60Updated 4 years ago
wangkenpu / rsrgan
Robust Speech Recognition Using Generative Adversarial Networks (GAN)
☆59Updated 5 years ago
bsxfan / meta-embeddings
Meta-embeddings are a probabilistic generalization of embeddings in machine learning.
☆22Updated 6 years ago
qqueing / SR_with_kaldi
Speaker embedding(verification and recognition) using Tensorflow with Kaldi
☆41Updated 7 years ago
sid0710 / audio_data_augmentation
☆26Updated 7 years ago
bootphon / abkhazia
ABX and kaldi experiments on speech corpora made easy
☆32Updated 9 months ago
Kyubyong / specAugment
Tensor2tensor experiment with SpecAugment
☆46Updated 6 years ago
jaideeppatel / Training-Targets-for-Speech-Separation-Neural-Networks
This is a project on working/resolving the speech separation problem using supervised learning on various training targets, building mach…
☆34Updated 8 years ago
swshon / multi-speakerID
☆30Updated 6 years ago
kan-bayashi / INTERSPEECH19_TUTORIAL
Interspeech 2019 tutorial materials
☆48Updated 5 years ago
OrcusCZ / NNAcousticModeling
☆24Updated 6 years ago
charlesliucn / awesome-end2end-speech-recognition
💬 A list of End-to-End speech recognition, including papers, codes and other materials
☆52Updated 6 years ago
liyongze / lstm_speaker_verification
☆35Updated 6 years ago
qqueing / speaker_embedding-pytorch
"An Improved Deep Embedding Learning Method for Short Duration Speaker Verification" pytorch implementation
☆19Updated 6 years ago
SiddGururani / Pytorch-TDNN
☆99Updated 7 years ago
lifelongeek / AAS_enhancement
This repository contains the code and supplementary result for the paper "Unpaired Speech Enhancement by Acoustic and Adversarial Supervi…
☆28Updated 5 years ago
funcwj / deep-clustering
deep clustering method for single-channel speech separation
☆109Updated 3 years ago
AppleHolic / PytorchSR
Pytorch based phoneme recognition (TIMIT phoneme classification)
☆34Updated 7 years ago
jfsantos / irasl2018
Code for the paper "Investigating the effect of residual and highway connections in speech enhancement models"
☆11Updated 6 years ago
ZitengWang / python_kaldi_features
python codes to extract MFCC and FBANK speech features for Kaldi
☆66Updated 6 years ago
joonson / voxsrc_2019
VoxSRC Challenge
☆31Updated 6 years ago
YiwenShaoStephen / pychain_example
☆48Updated 4 years ago
JaesungBae / Speech-Command-Recognition-with-Capsule-Network
Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.
☆25Updated 6 years ago
ShigekiKarita / espnet-semi-supervised
ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…
☆38Updated 5 years ago
rafaelvalle / asrgen
Attacking Speaker Recognition with Deep Generative Models
☆34Updated 2 years ago