whull / end2end_ASRLinks
端到端语音识别实现;包含LAS、CTC、RNNT解码方式,模型SA(MHA)、LSTM、CNN、DFSMN等
☆15Updated 4 years ago
Alternatives and similar repositories for end2end_ASR
Users that are interested in end2end_ASR are comparing it to the libraries listed below
Sorting:
- Went online decode demo☆31Updated 4 years ago
- ☆40Updated 4 years ago
- 以音素建模构建NN-CTC声学模型☆15Updated 6 years ago
- ☆33Updated 4 years ago
- ☆13Updated 4 years ago
- 基于随机森林和条件随机场的中文韵律预测模型☆28Updated last year
- ☆12Updated last year
- ☆15Updated 5 years ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆18Updated 3 years ago
- 将normalize过的中文文本,做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。☆13Updated 4 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆42Updated 3 years ago
- kaldi cnn-tdnnf baseline☆13Updated 4 years ago
- [Tiny KWS] SparkNet: Sparse Binarization for Fast Keyword Spotting☆16Updated 4 months ago
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆43Updated 2 years ago
- Attention-Enhanced Short-Time Wiener Solution for Acoustic Echo Cancellation☆22Updated 2 months ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Updated 3 years ago
- Speech command recognition with capsule network & various NNs / KWS on Google Speech Command Dataset.☆25Updated 6 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆18Updated 3 years ago
- An upgrade framework for train and validate compare with icefall using Lightning.☆13Updated 9 months ago
- it's a train acoustics model code lib☆27Updated 5 years ago
- ☆33Updated 4 years ago
- ☆22Updated 6 years ago
- One command to start a streaming ASR server.☆12Updated last year
- Detect emotion from audio☆13Updated 7 years ago
- 语音识别 语音前端处理 语音合成 语音转换等等语音技术的资料汇总☆22Updated 6 years ago
- it's ASR decoder and make graph project☆33Updated 3 years ago
- ☆11Updated 2 years ago
- A SPMI Lab toolkit for language models.☆11Updated 8 years ago
- Mining effective negative training samples for keyword spotting (PyTorch)☆64Updated 5 years ago
- Voice conversion model for real-time speech synthesis using PPG (Phonetic PosteriorGram) as an intermediate feature, written in Pytorch.☆28Updated 3 years ago