foamliu / Speech-TransformerLinks

PyTorch re-implementation of Speech-Transformer

☆101

Alternatives and similar repositories for Speech-Transformer

Users that are interested in Speech-Transformer are comparing it to the libraries listed below

Sorting:

by2101 / OpenASR
A pytorch based end2end speech recognition system.
☆115Updated 4 years ago
oshindow / Transformer-Transducer
A pytorch_lightning reimplementation of the Transducer module from ESPnet.
☆77Updated 4 years ago
ZhengkunTian / Speech-Tranformer-Pytorch
Seq2Seq Speech Recognition with Transformer on Mandarin Chinese
☆116Updated 5 years ago
biyoml / End-to-End-Mandarin-ASR
End-to-end speech recognition on AISHELL dataset.
☆32Updated 3 years ago
xingchensong / Speech-Transformer-tf2.0
transformer for ASR-systerm (via tensorflow2.0)
☆114Updated 6 years ago
eastonYi / end-to-end_asr_pytorch
Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch
☆23Updated 5 years ago
foamliu / Listen-Attend-Spell-v2
PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).
☆38Updated 6 years ago
ZhengkunTian / rnn-transducer
A Pytorch Implementation of Transducer Model for End-to-End Speech Recognition
☆236Updated 5 years ago
tencent-ailab / 3m-asr
3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition
☆118Updated 3 years ago
HawkAaron / E2E-ASR
PyTorch Implementations for End-to-End Automatic Speech Recognition
☆126Updated 6 years ago
Xiaoxiaohuangg / LAS-Chinese-pytorch
Listen, Attend and Spell - PyTorch Implementation
☆17Updated 6 years ago
upskyy / Transformer-Transducer
PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…
☆108Updated 3 years ago
kaituoxu / Listen-Attend-Spell
A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.
☆202Updated 6 years ago
celebrity-audio-collection / videoprocess
CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.
☆74Updated 5 years ago
burchim / EfficientConformer
[ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition
☆216Updated 2 years ago
charlesliucn / awesome-end2end-speech-recognition
💬 A list of End-to-End speech recognition, including papers, codes and other materials
☆52Updated 6 years ago
zw76859420 / ASR_WORD
采用端到端方法构建声学模型，以字为建模单元，采用DCNN-CTC网络结构。
☆70Updated 6 years ago
cvqluu / Factorized-TDNN
PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and …
☆147Updated 5 years ago
shiyuzh2007 / ASR
☆55Updated 5 years ago
R1ckShi / AESRC2020
[ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…
☆55Updated 4 years ago
jackaduma / LAS_Mandarin_PyTorch
Listen, attend and spell Model and a Chinese Mandarin Pretrained model (中文-普通话 ASR模型)
☆124Updated 2 years ago
xcmyz / Transformer-TTS
TTS model based on Transformer.
☆58Updated 6 years ago
HawkAaron / RNN-Transducer
MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
☆139Updated 4 years ago
ZitengWang / python_kaldi_features
python codes to extract MFCC and FBANK speech features for Kaldi
☆66Updated 6 years ago
DemisEom / RNNT-pytorch
Implementaion RNN tranceducer
☆23Updated 6 years ago
sonos / keyword-spotting-research-datasets
☆127Updated 4 years ago
wenet-e2e / WeTextProcessing.deprecated
☆61Updated 2 years ago
mdangschat / ctc-asr
End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
☆122Updated 5 years ago
funcwj / ge2e-speaker-verification
Pytorch implementation of "Generalized End-to-End Loss for Speaker Verification"
☆103Updated 6 years ago
Sundy1219 / eesen-for-thchs30
ASR for Chinese Mandarin
☆75Updated 7 years ago