786440445 / ASR-with-DFCNN-and-Transformer
Speech Recognition with DFCNN and Transformer
☆18Updated 2 years ago
Alternatives and similar repositories for ASR-with-DFCNN-and-Transformer:
Users that are interested in ASR-with-DFCNN-and-Transformer are comparing it to the libraries listed below
- Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch☆22Updated 4 years ago
- Listen, Attend and Spell - PyTorch Implementation☆17Updated 6 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆65Updated 6 years ago
- ☆55Updated 4 years ago
- Mining effective negative training samples for keyword spotting (PyTorch)☆59Updated 4 years ago
- End-to-end speech recognition on AISHELL dataset.☆31Updated 3 years ago
- ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别☆12Updated 4 years ago
- 采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。☆71Updated 6 years ago
- 以音素建模构建NN-CTC声学模型☆15Updated 5 years ago
- Minimize kaldi nnet3 chain decoder☆45Updated 5 years ago
- Tensorflow version of DFSMN☆49Updated 6 years ago
- Region proposal network based small-footprint keyword spotting (Pytorch)☆54Updated last year
- PyTorch implementation of "Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition"☆19Updated 5 years ago
- LSTM CTC End2End Speech Recognition.☆38Updated 5 years ago
- Encoder and Decoder and Attention Based Prosody Prediction☆68Updated 7 years ago
- Seq2Seq Speech Recognition with Transformer on Mandarin Chinese☆116Updated 5 years ago
- Use ctc to do chinese speech recognition by keras / 通过keras和ctc实现中文语音识别☆43Updated 6 years ago
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆55Updated 4 years ago
- it's a train acoustics model code lib☆26Updated 4 years ago
- ☆50Updated 4 years ago
- Automatic Speech Recognition with TensorFlow(CNN+BLSTM+CTC)☆12Updated 6 years ago
- ASR for Chinese Mandarin☆75Updated 6 years ago
- 用于机器学习的语音特征提取,包含FBank和MFCC等,原理讲解和step by step的实现☆52Updated 5 years ago
- Robust Speech Recognition Using Generative Adversarial Networks (GAN)☆58Updated 5 years ago
- CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.☆72Updated 5 years ago
- ☆16Updated 5 years ago
- Listen, Attend and Spell (LAS) framework for speech recognition (see https://arxiv.org/pdf/1508.01211.pdf).☆31Updated 5 years ago
- Implementaion RNN tranceducer☆21Updated 5 years ago
- tacotron-2(pytorch) + melgan(pytorch) chinese TTS☆26Updated last year
- This is a implementation of kaldi-plda.☆15Updated 6 years ago