whull / end2end_ASRLinks
端到端语音识别实现;包含LAS、CTC、RNNT解码方式,模型SA(MHA)、LSTM、CNN、DFSMN等
☆15Updated 4 years ago
Alternatives and similar repositories for end2end_ASR
Users that are interested in end2end_ASR are comparing it to the libraries listed below
Sorting:
- Went online decode demo☆31Updated 4 years ago
- 基于单语种语料的中英混合语音识别算法-同花顺算法挑战赛-2021年9-10月双月赛☆14Updated 3 years ago
- 以音素建模构建NN-CTC声学模型☆15Updated 6 years ago
- ☆10Updated last year
- ☆40Updated 3 years ago
- ☆13Updated 4 years ago
- 本项目使用中文人声的数据集,在Speech Denoising with Deep Feature Losses网络的基础上fine-tune,得到对中文音频有更好去噪效果的结果☆28Updated 5 years ago
- 将normalize过的中文文本,做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。☆13Updated 4 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆41Updated 2 years ago
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆42Updated 2 years ago
- kaldi cnn-tdnnf baseline☆13Updated 3 years ago
- Addressing Text-dependent Speaker Verification Using Singing Speech☆9Updated 6 years ago
- tacotron-2(pytorch) + melgan(pytorch) chinese TTS☆26Updated 2 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Updated 6 years ago
- Detect emotion from audio☆13Updated 6 years ago
- Keyword Search Recipe for Subword ASR☆30Updated 6 years ago
- 基于随机森林和条件随机场的中文韵律预测模型☆28Updated last year
- ☆32Updated 4 years ago
- End-to-end speech recognition on AISHELL dataset.☆32Updated 3 years ago
- it's ASR decoder and make graph project☆33Updated 3 years ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆17Updated 3 years ago
- 基于Kaldi的小词汇量汉语语音识别,使用DNN训练☆27Updated 6 years ago
- EfficientNet-Absolute Zero for Continuous Speech Keyword Spotting☆23Updated 3 years ago
- c++ code for merlin tts☆22Updated 5 years ago
- magicspeech competition recipe☆18Updated 5 years ago
- simple dnn based vad☆70Updated 6 years ago
- Attention-based model for keywords spotting☆19Updated 4 years ago
- Keyword Spotting for detecting a word in an audio file☆17Updated 6 years ago
- DNN and RCED speech enhancement☆19Updated last year
- One command to build TLG.fst for WeNet.☆31Updated 2 years ago