whull / end2end_ASR
端到端语音识别实现;包含LAS、CTC、RNNT解码方式,模型SA(MHA)、LSTM、CNN、DFSMN等
☆14Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for end2end_ASR
- Went online decode demo☆29Updated 3 years ago
- 基于单语种语料的中英混合语音识别算法-同花顺算法挑战赛-2021年9-10月双月赛☆14Updated 3 years ago
- 分享在深蓝学院《语音识别:从入门到精通》第一期课程学习过程中完成的课后作业,供参考。☆21Updated 4 years ago
- ☆34Updated 3 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆38Updated last year
- kaldi cnn-tdnnf baseline☆13Updated 3 years ago
- ☆30Updated 3 years ago
- 基于Kaldi的小词汇量汉语语音识别,使用DNN训练☆27Updated 5 years ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆15Updated 2 years ago
- tacotron-2(pytorch) + melgan(pytorch) chinese TTS☆26Updated last year
- 基于随机森林和条件随机场的中文韵律预测模型☆27Updated 3 months ago
- it's ASR decoder and make graph project☆32Updated 2 years ago
- 将normalize过的中文文本,做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。☆12Updated 3 years ago
- One command to build TLG.fst for WeNet.☆29Updated 2 years ago
- 以音素建模构建NN-CTC声学模型☆15Updated 5 years ago
- Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss with beam search and negative sampling strategy, Smoothed Max Po…☆20Updated 3 weeks ago
- Addressing Text-dependent Speaker Verification Using Singing Speech☆9Updated 5 years ago
- Listen, Attend and Spell - PyTorch Implementation☆17Updated 5 years ago
- it's a train acoustics model code lib☆26Updated 4 years ago
- ☆13Updated 3 years ago
- ☆31Updated 2 years ago
- Minimize kaldi nnet3 chain decoder☆45Updated 4 years ago
- 本项目使用中文人声的数据集,在Speech Denoising with Deep Feature Losses网络的基础上fine-tune,得到对中文音频有更好去噪效果的结果☆26Updated 4 years ago
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆38Updated last year
- ☆22Updated 5 years ago
- A library for adding punctuation into a text from ASR.☆17Updated last year
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆25Updated 7 months ago
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆15Updated 3 years ago
- MagicData-RAMC Dataset and Baseline☆49Updated 2 years ago
- 语音信号处理的基本知识☆35Updated 5 years ago