zhaoyu611 / Automatic_Speech_Recognition_with_Multi_Models
A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popular cells (most are LSTM and its variants) and models (unidirectioanl RNN, bidirectional RNN, ResNet and so on). Moreover, you are welcome to play with self-defined cells or models.
☆19Updated 7 years ago
Alternatives and similar repositories for Automatic_Speech_Recognition_with_Multi_Models:
Users that are interested in Automatic_Speech_Recognition_with_Multi_Models are comparing it to the libraries listed below
- System for identifying speaker from given speech signal using MFCC,LPC features and Gaussian Mixture Models☆21Updated 7 years ago
- ☆60Updated 4 years ago
- Robust Speech Recognition Using Generative Adversarial Networks (GAN)☆59Updated 5 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆95Updated 4 years ago
- ☆35Updated 5 years ago
- Speaker embedding(verification and recognition) using Tensorflow with Kaldi☆41Updated 7 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆45Updated 4 years ago
- This is a project on working/resolving the speech separation problem using supervised learning on various training targets, building mach…☆34Updated 7 years ago
- This Repository includes four different implementations of the Speaker Verification task including the GMM_UBM, Ivector, Deep-Speaker, an…☆32Updated 6 years ago
- ☆99Updated 7 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆38Updated 7 years ago
- Estimate the number of concurrent speakers from single channel mixtures to crack the "cocktail-party” problem.☆22Updated 5 years ago
- Python implementation of pre-processing for End-to-End speech recognition☆69Updated 7 years ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 6 years ago
- This is a working example of using CTC for phone recognition on TIMIT☆50Updated 7 years ago
- A set of speech feature extraction functions for ASR and speaker identification written in matlab.☆43Updated 8 years ago
- End to End Dialect Identification using Convolutional Neural Network☆52Updated 5 years ago
- Develop speaker recognition model based on i-vector using TIMIT database☆16Updated 5 years ago
- A tensorflow implementation of my paper Combining beamforming and deep neural networks for multi-channel speech extraction☆62Updated 4 years ago
- VoxCeleb plugin for pyannote.database☆29Updated 3 years ago
- Listen, Attend and Spell (LAS) framework for speech recognition (see https://arxiv.org/pdf/1508.01211.pdf).☆31Updated 5 years ago
- Denoise Speech (Enhanced Speech or Speech enhancement) by Deep Learning (Using Keras and Tensorflow)☆39Updated 7 years ago
- Deep Discriminative Embeddings for Duration Robust Speaker Verification☆19Updated 5 years ago
- about Speech enhancement☆33Updated 6 years ago
- Tensorflow implementation for Speech Enhancement (DDAE)☆48Updated 6 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆65Updated 6 years ago
- Feature extraction of speech signal is the initial stage of any speech recognition system.☆92Updated 4 years ago
- Code for the paper: Deep Residual Networks with Auditory Inspired Features for Robust Speech Recognition.☆21Updated 8 years ago
- An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…☆54Updated 5 years ago
- A pytorch implementation of xvector embedding☆79Updated 4 years ago