zhaoyu611 / Automatic_Speech_Recognition_with_Multi_ModelsLinks

A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popular cells (most are LSTM and its variants) and models (unidirectioanl RNN, bidirectional RNN, ResNet and so on). Moreover, you are welcome to play with self-defined cells or models.

☆19

Alternatives and similar repositories for Automatic_Speech_Recognition_with_Multi_Models

Users that are interested in Automatic_Speech_Recognition_with_Multi_Models are comparing it to the libraries listed below

Sorting:

mravanelli / pytorch_MLP_for_ASR
This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…
☆38Updated 7 years ago
jinserk / pytorch-asr
ASR with PyTorch
☆139Updated 6 years ago
hirofumi0810 / asr_preprocessing
Python implementation of pre-processing for End-to-End speech recognition
☆69Updated 7 years ago
SiddGururani / Pytorch-TDNN
☆99Updated 7 years ago
wangkenpu / rsrgan
Robust Speech Recognition Using Generative Adversarial Networks (GAN)
☆59Updated 5 years ago
qqueing / SR_with_kaldi
Speaker embedding(verification and recognition) using Tensorflow with Kaldi
☆41Updated 7 years ago
aishoot / Speech_Feature_Extraction
Feature extraction of speech signal is the initial stage of any speech recognition system.
☆93Updated 4 years ago
staplesinLA / denoising_DIHARD18
☆60Updated 4 years ago
tbornt / phoneme_ctc
Bidirectional dynamic RNN + CTC for phoneme recognition
☆46Updated 5 years ago
fernandodelacalle / ResNet-Kaldi-Tensorflow-ASR
Code for the paper: Deep Residual Networks with Auditory Inspired Features for Robust Speech Recognition.
☆21Updated 8 years ago
rajathkmp / speaker-verification
Implementation of state of the art d-vector approach for speaker verification
☆127Updated 7 years ago
swshon / voxceleb-ivector
Voxceleb1 i-vector based speaker recognition system
☆43Updated 7 years ago
auspicious3000 / WaveNet-Enhancement
Speech Enhancement using Bayesian WaveNet
☆96Updated 7 years ago
mravanelli / pySpeechRev
This python code performs an efficient speech reverberation starting from a dataset of close-talking speech signals and a collection of a…
☆95Updated 5 years ago
xingchensong / Speech-Transformer-tf2.0
transformer for ASR-systerm (via tensorflow2.0)
☆114Updated 6 years ago
genzen2103 / Speaker-Recognition-System-using-GMM
System for identifying speaker from given speech signal using MFCC,LPC features and Gaussian Mixture Models
☆21Updated 7 years ago
Suhee05 / Text-Independent-Speaker-Verification
Text Independent Speaker Verification Using GE2E Loss
☆84Updated 6 years ago
weedwind / CTC-speech-recognition
This is a working example of using CTC for phone recognition on TIMIT
☆50Updated 7 years ago
PengdaLiu / LAS-SpeechRecognition
Listen, Attend and Spell (LAS) framework for speech recognition (see https://arxiv.org/pdf/1508.01211.pdf).
☆32Updated 6 years ago
jameslyons / matlab_speech_features
A set of speech feature extraction functions for ASR and speaker identification written in matlab.
☆43Updated 8 years ago
cvqluu / Factorized-TDNN
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and …
☆147Updated 5 years ago
funcwj / deep-clustering
deep clustering method for single-channel speech separation
☆109Updated 3 years ago
charlesliucn / awesome-end2end-speech-recognition
💬 A list of End-to-End speech recognition, including papers, codes and other materials
☆52Updated 6 years ago
liyongze / lstm_speaker_verification
☆35Updated 6 years ago
GauravWaghmare / Speaker-Identification
A program for automatic speaker identification using deep learning techniques.
☆84Updated 8 years ago
swshon / dialectID_e2e
End to End Dialect Identification using Convolutional Neural Network
☆52Updated 5 years ago
snsun / pit-speech-separation
☆130Updated 6 years ago
zhr1201 / Multi-channel-speech-extraction-using-DNN
A tensorflow implementation of my paper Combining beamforming and deep neural networks for multi-channel speech extraction
☆64Updated 4 years ago
cvqluu / TDNN
Time delay neural network (TDNN) implementation in Pytorch using unfold method
☆202Updated 5 years ago
PiotrTa / Huawei-Challenge-Speaker-Identification
Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.
☆36Updated 5 years ago