zhaoyu611 / Automatic_Speech_Recognition_with_Multi_Models
A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popular cells (most are LSTM and its variants) and models (unidirectioanl RNN, bidirectional RNN, ResNet and so on). Moreover, you are welcome to play with self-defined cells or models.
☆19Updated 6 years ago
Related projects ⓘ
Alternatives and complementary repositories for Automatic_Speech_Recognition_with_Multi_Models
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆37Updated 6 years ago
- Speaker embedding(verification and recognition) using Tensorflow with Kaldi☆41Updated 7 years ago
- ☆59Updated 4 years ago
- Voxceleb1 i-vector based speaker recognition system☆43Updated 6 years ago
- Robust Speech Recognition Using Generative Adversarial Networks (GAN)☆58Updated 4 years ago
- Bidirectional dynamic RNN + CTC for phoneme recognition☆44Updated 4 years ago
- This is a project on working/resolving the speech separation problem using supervised learning on various training targets, building mach…☆34Updated 7 years ago
- This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…☆95Updated 4 years ago
- ☆26Updated 7 years ago
- System for identifying speaker from given speech signal using MFCC,LPC features and Gaussian Mixture Models☆21Updated 7 years ago
- Develop speaker recognition model based on i-vector using TIMIT database☆16Updated 5 years ago
- Python implementation of pre-processing for End-to-End speech recognition☆69Updated 6 years ago
- Speech separation with utterance-level PIT experiments☆102Updated 6 years ago
- ☆98Updated 6 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆63Updated 5 years ago
- speech enhancement GAN on waveform/log-power-spectrum data using Improved WGAN☆34Updated 6 years ago
- ☆35Updated 5 years ago
- Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.☆36Updated 5 years ago
- Estimate the number of concurrent speakers from single channel mixtures to crack the "cocktail-party” problem.☆22Updated 4 years ago
- A pytorch implementation of xvector embedding☆78Updated 4 years ago
- Deep Discriminative Embeddings for Duration Robust Speaker Verification☆19Updated 4 years ago
- fast SpecAugmentation code with numpy and scipy☆30Updated 5 years ago
- Code for the paper: Deep Residual Networks with Auditory Inspired Features for Robust Speech Recognition.☆21Updated 7 years ago
- ☆27Updated 6 years ago
- ICASSP2019 Tutorial: Detection and Classification of Acoustic Scenes and Events / Code examples☆41Updated 5 years ago
- python script for voice activity detection.☆34Updated 3 months ago
- Custom decoders for Kaldi☆80Updated 5 years ago
- Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"☆101Updated 5 years ago
- ☆54Updated 5 years ago