whull / end2end_ASR
端到端语音识别实现;包含LAS、CTC、RNNT解码方式,模型SA(MHA)、LSTM、CNN、DFSMN等
☆15Updated 3 years ago
Alternatives and similar repositories for end2end_ASR:
Users that are interested in end2end_ASR are comparing it to the libraries listed below
- 基于单语种语料的中英混合语音识别算法-同花顺算法挑战赛-2021年9-10月双月赛☆14Updated 3 years ago
- Went online decode demo☆29Updated 3 years ago
- 以音素建模构建NN-CTC声学模型☆15Updated 5 years ago
- ☆37Updated 3 years ago
- 分享在深蓝学院《语音识别:从入门到精通》第一期课程学习过程中完成的课后作业,供参考。☆21Updated 4 years ago
- kaldi cnn-tdnnf baseline☆13Updated 3 years ago
- 基于随机森林和条件随机场的中文韵律预测模型☆28Updated 9 months ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆16Updated 2 years ago
- tacotron-2(pytorch) + melgan(pytorch) chinese TTS☆26Updated last year
- A library for adding punctuation into a text from ASR.☆17Updated last year
- ☆13Updated 4 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆9Updated 2 years ago
- 宋知用《MATLAB在语音信号分析与合成中的应用》 Python版☆35Updated 3 years ago
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆41Updated last year
- ☆31Updated 3 years ago
- Keyword spotting for audio with attention (KWS model for audio)☆18Updated 3 years ago
- Utilizes ONNX Runtime for audio denoising.☆44Updated this week
- 将normalize过的中文文本,做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。☆13Updated 4 years ago
- An Automatic Speech Recognition using GMM & HMM.☆19Updated 5 years ago
- ☆31Updated 2 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…☆29Updated last month
- Feedforward Sequential Memory Networks☆15Updated 2 years ago
- ASR教程: https://dataxujing.github.io/ASR-paper/☆24Updated 9 months ago
- Rank 7th/1817 in the 2018 iFLYTEK AI Developer Challenge with acc 0.82 for the ten Chinese dialects classification task, this code was p…☆13Updated last year
- Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch☆22Updated 4 years ago
- A ctc decoder for both online and offline asr model☆63Updated last year
- it's ASR decoder and make graph project☆32Updated 2 years ago
- 本项目使用中文人声的数据集,在Speech Denoising with Deep Feature Losses网络的基础上fine-tune,得到对中文音频有更好去噪效果的结果☆27Updated 5 years ago
- This is the official implementation of " Enhancing Embeddings for Speech Classification in Noisy Conditions"☆10Updated last year