Luka0612 / asr_transformer
采用transformer end2end训练语音识别ASR
☆0Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for asr_transformer
- ☆107Updated 3 years ago
- Tensorflow version of DFSMN☆48Updated 6 years ago
- 采用端到端方法构建声学模型,以字为建模单元,采用DCNN-CTC网络结构。☆71Updated 5 years ago
- Implementations for FSMN (Feedforward Sequential Memory Network), cFSMN, DFSMN, and PFSMN units☆9Updated 6 years ago
- ☆55Updated 4 years ago
- 基于dVector的说话人识别keras☆87Updated 3 years ago
- An LDA/PLDA estimator using KALDI in python for speaker verification tasks☆99Updated 7 years ago
- Listen, Attend and Spell - PyTorch Implementation☆17Updated 5 years ago
- A pytorch_lightning reimplementation of the Transducer module from ESPnet.☆75Updated 3 years ago
- Minimize kaldi nnet3 chain decoder☆45Updated 4 years ago
- MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks☆136Updated 3 years ago
- 以音素建模构建NN-CTC声学模型☆15Updated 5 years ago
- ☆142Updated 4 years ago
- Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch☆22Updated 4 years ago
- Use ctc to do chinese speech recognition by keras / 通过keras和ctc实现中文语音识别☆43Updated 6 years ago
- ASR for Chinese Mandarin☆75Updated 6 years ago
- python codes to extract MFCC and FBANK speech features for Kaldi☆63Updated 5 years ago
- LSTM CTC End2End Speech Recognition.☆38Updated 5 years ago
- Automatic Speech Recognition with TensorFlow(CNN+BLSTM+CTC)☆12Updated 6 years ago
- Seq2Seq Speech Recognition with Transformer on Mandarin Chinese☆115Updated 4 years ago
- 本项目使用中文人声的数据集,在Speech Denoising with Deep Feature Losses网络的基础上fine-tune,得到对中文音频有更好去噪效果的结果☆26Updated 5 years ago
- The Implementation of FastSpeech2 Based on Pytorch.☆52Updated last year
- [ICASSP2021] Data preperation scripts, training pipeline and baseline experiment results for the Interspeech 2020 Accented English Speech…☆55Updated 4 years ago
- 基于卷积神经网络的语音识别声学模型的研究☆171Updated 5 years ago
- 用于机器学习的语音特征提取,包含FBank和MFCC等,原理讲解和step by step的实现☆50Updated 5 years ago
- py-webrtcvad wrapper for trimming speech clips☆47Updated 2 years ago
- A librosa STFT/Fbank/mfcc feature extration written up in PyTorch using 1D Convolutions.☆73Updated 2 years ago
- Implementation of the work presented in "CNN based Query by Example Spoken Term Detection"☆32Updated 6 years ago