Luka0612 / asr_transformerLinks

采用transformer end2end训练语音识别ASR

☆0

Alternatives and similar repositories for asr_transformer

Users that are interested in asr_transformer are comparing it to the libraries listed below

Sorting:

yangxueruivs / DFSMN
Tensorflow version of DFSMN
☆49Updated 6 years ago
zw76859420 / ASR_WORD
采用端到端方法构建声学模型，以字为建模单元，采用DCNN-CTC网络结构。
☆70Updated 6 years ago
shiyuzh2007 / ASR
☆55Updated 5 years ago
tzyll / kaldi
☆106Updated 4 years ago
sailordiary / fsmn
Implementations for FSMN (Feedforward Sequential Memory Network), cFSMN, DFSMN, and PFSMN units
☆9Updated 6 years ago
Xiaoxiaohuangg / LAS-Chinese-pytorch
Listen, Attend and Spell - PyTorch Implementation
☆17Updated 6 years ago
zw76859420 / ASR_Phone
以音素建模构建NN-CTC声学模型
☆15Updated 6 years ago
ZhengkunTian / Speech-Tranformer-Pytorch
Seq2Seq Speech Recognition with Transformer on Mandarin Chinese
☆116Updated 5 years ago
mobvoi / lstm_ctc
LSTM CTC End2End Speech Recognition.
☆38Updated 6 years ago
oshindow / Transformer-Transducer
A pytorch_lightning reimplementation of the Transducer module from ESPnet.
☆77Updated 4 years ago
by2101 / OpenASR
A pytorch based end2end speech recognition system.
☆114Updated 4 years ago
eastonYi / end-to-end_asr_pytorch
Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch
☆22Updated 4 years ago
HawkAaron / RNN-Transducer
MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
☆139Updated 4 years ago
Diamondfan / CTC_pytorch
CTC end -to-end ASR for timit and 863 corpus.
☆218Updated 5 years ago
Sundy1219 / eesen-for-thchs30
ASR for Chinese Mandarin
☆75Updated 7 years ago
jingyonghou / RPN_KWS
Region proposal network based small-footprint keyword spotting (Pytorch)
☆55Updated last year
jx1100370217 / ASR_dosmono
Automatic Speech Recognition with TensorFlow(CNN+BLSTM+CTC)
☆12Updated 6 years ago
EliasCai / speech_recognition_ctc
Use ctc to do chinese speech recognition by keras / 通过keras和ctc实现中文语音识别
☆43Updated 6 years ago
idiap / CNN_QbE_STD
Implementation of the work presented in "CNN based Query by Example Spoken Term Detection"
☆32Updated 6 years ago
R1ckShi / AESRC2020
[ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…
☆55Updated 4 years ago
funcwj / kaldi-python-io
A python IO interface for data accessing in kaldi
☆39Updated 4 years ago
RicherMans / PLDA
An LDA/PLDA estimator using KALDI in python for speaker verification tasks
☆100Updated 8 years ago
Magic-Bubble / SpeechProcessForMachineLearning
用于机器学习的语音特征提取，包含FBank和MFCC等，原理讲解和step by step的实现
☆52Updated 6 years ago
lenovo-voice / THE-2020-PERSONALIZED-VOICE-TRIGGER-CHALLENGE-BASELINE-SYSTEM
☆51Updated 4 years ago
786440445 / ASR-with-DFCNN-and-Transformer
Speech Recognition with DFCNN and Transformer
☆18Updated 2 years ago
0three / Speech-Denoise-With-Feature-Loss
本项目使用中文人声的数据集，在Speech Denoising with Deep Feature Losses网络的基础上fine-tune，得到对中文音频有更好去噪效果的结果
☆27Updated 5 years ago
BoragoCode / AttentionBasedProsodyPrediction
Encoder and Decoder and Attention Based Prosody Prediction
☆68Updated 7 years ago
BUTSpeechFIT / x-vector-kaldi-tf
Tensorflow implementation of x-vector topology on top of Kaldi recipe
☆119Updated 5 years ago
wangkenpu / rsrgan
Robust Speech Recognition Using Generative Adversarial Networks (GAN)
☆59Updated 5 years ago
xiangxyq / minimize-chain-decoder
Minimize kaldi nnet3 chain decoder
☆45Updated 5 years ago