glynpu / asr_abc
中文语音识别,automatic speech recognition(ASR)
☆11Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for asr_abc
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆11Updated last year
- E2E ASR system☆14Updated 2 years ago
- 基于单语种语料的中英混合语音识别算法-同花顺算法挑战赛-2021年9-10月双月赛☆14Updated 3 years ago
- kaldi cnn-tdnnf baseline☆13Updated 3 years ago
- Went online decode demo☆29Updated 3 years ago
- Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"☆15Updated 3 years ago
- 端到端语音识别实现;包含LAS、CTC、RNNT解码方式,模型SA(MHA)、LSTM、CNN、DFSMN等☆14Updated 3 years ago
- ☆13Updated last year
- ☆13Updated 3 years ago
- repository for paper "Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis"☆15Updated 2 years ago
- An Automatic Speech Recognition using GMM & HMM.☆18Updated 5 years ago
- ☆15Updated 2 years ago
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆14Updated 3 months ago
- The project for speech translation☆11Updated last year
- 将normalize过的中文文本,做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。☆12Updated 3 years ago
- ☆10Updated last year
- ☆11Updated 3 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated last year
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆38Updated last year
- ☆13Updated 2 years ago
- PHO-LID: A Unified Model to Incorporate Acoustic-Phonetic and Phonotactic Information for Language Identification☆19Updated last year
- This repository contains code for applying Data2Vec to pretrain Keyword Transformer model as described in "Improving Label-Deficient Keyw…☆26Updated 2 months ago
- it's ASR decoder and make graph project☆32Updated 2 years ago
- Goodness of Pronunciation algorithm using PyKaldi☆14Updated 2 years ago
- ☆21Updated 2 weeks ago
- Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss with beam search and negative sampling strategy, Smoothed Max Po…☆20Updated last month
- A STFT/iSTFT written up in PyTorch using 1D Convolutions☆25Updated 4 months ago
- Source Code for the Paper "UNIFIED KEYWORD SPOTTING AND AUDIO TAGGING ON MOBILE DEVICES WITH TRANSFORMERS"☆23Updated last year
- A simple command line tool to calculate WER for ASR.☆13Updated last month