charlesliucn / awesome-end2end-speech-recognitionLinks

💬 A list of End-to-End speech recognition, including papers, codes and other materials

☆52

Alternatives and similar repositories for awesome-end2end-speech-recognition

Users that are interested in awesome-end2end-speech-recognition are comparing it to the libraries listed below

Sorting:

wangkenpu / rsrgan
Robust Speech Recognition Using Generative Adversarial Networks (GAN)
☆59Updated 6 years ago
ZitengWang / python_kaldi_features
python codes to extract MFCC and FBANK speech features for Kaldi
☆67Updated 7 years ago
JRMeyer / multi-task-kaldi
An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…
☆55Updated 5 years ago
xingchensong / Speech-Transformer-tf2.0
transformer for ASR-systerm (via tensorflow2.0)
☆114Updated 6 years ago
swshon / voxceleb-ivector
Voxceleb1 i-vector based speaker recognition system
☆44Updated 7 years ago
R1ckShi / AESRC2020
[ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…
☆56Updated 5 years ago
cvqluu / Factorized-TDNN
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and …
☆149Updated 5 years ago
RicherMans / PLDA
An LDA/PLDA estimator using KALDI in python for speaker verification tasks
☆102Updated 8 years ago
rwth-i6 / returnn-experiments
experiments with RETURNN
☆161Updated 3 months ago
HawkAaron / RNN-Transducer
MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
☆139Updated 4 years ago
FlorianKrey / DNC
Discriminative Neural Clustering for Speaker Diarisation
☆79Updated 3 years ago
mdangschat / ctc-asr
End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
☆123Updated 5 years ago
HawkAaron / E2E-ASR
PyTorch Implementations for End-to-End Automatic Speech Recognition
☆127Updated 6 years ago
shiyuzh2007 / ASR
☆55Updated 5 years ago
funcwj / ge2e-speaker-verification
Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"
☆103Updated 6 years ago
MarkWuNLP / SemanticMask
The repo contains our code of ``Semantic Mask for Transformer based End-to-End Speech Recognition"
☆39Updated 5 years ago
idiap / pkwrap
A pytorch wrapper for LF-MMI training and parallel training in Kaldi
☆73Updated 3 years ago
Dannynis / xvector_pytorch
A pytorch implementation of xvector embedding
☆79Updated 5 years ago
BUTSpeechFIT / x-vector-kaldi-tf
Tensorflow implementation of x-vector topology on top of Kaldi recipe
☆120Updated 6 years ago
cageyoko / CTC-Attention-Mispronunciation
A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques
☆63Updated 4 years ago
naxingyu / interactive_e2e_speech_recognition
☆38Updated 5 years ago
1ytic / warp-rna
Recurrent Neural Aligner
☆51Updated 5 years ago
oshindow / Transformer-Transducer
A pytorch_lightning reimplementation of the Transducer module from ESPnet.
☆78Updated 4 years ago
bjfu-ai-institute / speaker-recognition-papers
Share some recent speaker recognition papers and their implementations.
☆90Updated 6 years ago
RaviSoji / plda
Probabilistic Linear Discriminant Analysis & classification, written in Python.
☆131Updated 3 years ago
SiddGururani / Pytorch-TDNN
☆99Updated 8 years ago
staplesinLA / denoising_DIHARD18
☆60Updated 5 years ago
athena-team / athena-decoder
☆76Updated 3 years ago
jzlianglu / pykaldi2
Yet another speech toolkit based on Kaldi and PyTorch
☆173Updated 5 years ago
Jamiroquai88 / VBDiarization
Speaker diarization based on Kaldi x-vectors, tuned for 16k microphone data
☆96Updated 2 years ago