语音识别 论文 前沿
☆51Jan 8, 2022Updated 4 years ago
Alternatives and similar repositories for ASR_awesome
Users that are interested in ASR_awesome are comparing it to the libraries listed below
Sorting:
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆219Jun 22, 2023Updated 2 years ago
- ASR教程: https://dataxujing.github.io/ASR-paper/☆25Jul 1, 2024Updated last year
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- An upgrade framework for train and validate compare with icefall using Lightning.☆15Mar 26, 2025Updated 11 months ago
- Minimize kaldi nnet3 chain decoder☆45Jan 10, 2020Updated 6 years ago
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- WikiQA,复现论文《Multihop Atention Networks for Qestion Answer Matching》☆11Mar 25, 2019Updated 6 years ago
- ☆13Sep 12, 2017Updated 8 years ago
- ☆29Aug 8, 2024Updated last year
- it's ASR decoder and make graph project☆33May 26, 2022Updated 3 years ago
- Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)☆33May 17, 2024Updated last year
- End-to-end MOdeling of ASR (Automatic Speech Recognition)☆33Feb 16, 2023Updated 3 years ago
- ☆15Aug 30, 2022Updated 3 years ago
- Text Matching Based on LCQMC: A Large-scale Chinese Question Matching Corpus☆15Jan 12, 2021Updated 5 years ago
- Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.☆16Nov 5, 2020Updated 5 years ago
- Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)☆16May 8, 2022Updated 3 years ago
- ☆22Jul 8, 2019Updated 6 years ago
- Deep model with built-in self-attention alignment for acoustic echo cancellation, Pytorch implement☆39Jul 25, 2023Updated 2 years ago
- Automatic Speech Recognition with TensorFlow(CNN+BLSTM+CTC)☆12Aug 9, 2018Updated 7 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- Pytorch Bindings for warp-ctc maintained by ESPnet☆17Feb 20, 2021Updated 5 years ago
- ☆16Jun 13, 2022Updated 3 years ago
- hmm是实现命名实体识别,python 实现,对2014的人民日报语料进行按字切分,统计初始、转换、发射概率☆16Oct 19, 2017Updated 8 years ago
- keras encoder-decoder☆17Apr 3, 2018Updated 7 years ago
- End-to-End Speech Processing Toolkit☆15Jan 20, 2025Updated last year
- asr2k☆52Jun 2, 2024Updated last year
- Keyword spotting for audio with attention (KWS model for audio)☆18Jul 15, 2021Updated 4 years ago
- 从Kaldi中裁剪的轻量级语音识别解码推理框架,目前实现了MFCC+GMM+Viterbi,不依赖OpenFST、OpenBLAS等库☆22Jul 31, 2021Updated 4 years ago
- Finetune Wa2vec 2.0 For Speech Recognition☆146Feb 6, 2025Updated last year
- Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem☆98May 30, 2025Updated 9 months ago
- Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch☆23Jul 28, 2020Updated 5 years ago
- List of Large Lanugage Model Papers☆60Jun 5, 2023Updated 2 years ago
- The case study and multilingfual performance of ICASSP submission☆24Sep 24, 2022Updated 3 years ago
- Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss with beam search and negative sampling strategy, Smoothed Max Po…☆24Oct 11, 2024Updated last year
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆59Sep 6, 2023Updated 2 years ago
- react版本的labelImage☆11Oct 26, 2021Updated 4 years ago
- PyTorch toolkit for streaming speech recognition, speech translation and simultaneous translation based on fairseq.☆25Oct 3, 2022Updated 3 years ago
- Production first, nn-based on-device signal processing toolkit.☆65May 30, 2023Updated 2 years ago
- E2E system with LF-MMI; word N-gram for Mandarin☆166Apr 29, 2022Updated 3 years ago