TowerYsable/ASR_awesome

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TowerYsable/ASR_awesome)

TowerYsable / ASR_awesome

语音识别论文前沿

☆53

Alternatives and similar repositories for ASR_awesome

Users that are interested in ASR_awesome are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

burchim / EfficientConformer
View on GitHub
[ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition
☆221Jun 22, 2023Updated 3 years ago
xuchennlp / S2T
View on GitHub
The project for speech translation
☆12Sep 28, 2023Updated 2 years ago
idiap / contextual-biasing-on-gpus
View on GitHub
Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…
☆21Sep 25, 2023Updated 2 years ago
ZQuang2202 / Zipformer_Lightning
View on GitHub
An upgrade framework for train and validate compare with icefall using Lightning.
☆16Mar 26, 2025Updated last year
chutaklee / CantoASR
View on GitHub
Fine-tuning Wav2Vec2.0 on Common Voice(zh-HK)
☆16May 8, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
DataXujing / ASR-paper
View on GitHub
ASR教程: https://dataxujing.github.io/ASR-paper/
☆26Jul 1, 2024Updated 2 years ago
datemoon / tf-code-acoustics
View on GitHub
it's a train acoustics model code lib
☆27May 20, 2020Updated 6 years ago
jihyun300 / Speech-Recognizer
View on GitHub
Construct GMM-HMM and Implement the Viterbi algorithm for continuous speech recognition
☆15Apr 1, 2018Updated 8 years ago
xiangxyq / minimize-chain-decoder
View on GitHub
Minimize kaldi nnet3 chain decoder
☆45Jan 10, 2020Updated 6 years ago
k2-fsa / fast_rnnt
View on GitHub
A torch implementation of a recursion which turns out to be useful for RNN-T.
☆149Aug 25, 2023Updated 2 years ago
TeaPoly / warp-ctc-crf
View on GitHub
An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.
☆12Jul 5, 2021Updated 5 years ago
stevenhillis / awesome-asr-contextualization
View on GitHub
A curated list of awesome papers on contextualizing E2E ASR outputs
☆81May 10, 2023Updated 3 years ago
emonosuke / emoASR
View on GitHub
End-to-end MOdeling of ASR (Automatic Speech Recognition)
☆33Feb 16, 2023Updated 3 years ago
hirofumi0810 / neural_sp
View on GitHub
End-to-end ASR/LM implementation with PyTorch
☆594Aug 30, 2021Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
snsun / kaldi-decoder-code-reading
View on GitHub
☆33Oct 28, 2022Updated 3 years ago
du-ud / kaldi-cslt
View on GitHub
☆15Aug 30, 2022Updated 3 years ago
danpovey / conditional-flow-matching
View on GitHub
☆29Aug 8, 2024Updated last year
lhwcv / self_attention_alignment
View on GitHub
Deep model with built-in self-attention alignment for acoustic echo cancellation, Pytorch implement
☆39Jul 25, 2023Updated 2 years ago
ZhengkunTian / OpenTransformer
View on GitHub
A No-Recurrence Sequence-to-Sequence Model for Speech Recognition
☆378Jul 21, 2022Updated 4 years ago
datemoon / ASR-decoder
View on GitHub
it's ASR decoder and make graph project
☆33May 26, 2022Updated 4 years ago
khanld / ASR-Wav2vec-Finetune
View on GitHub
Finetune Wa2vec 2.0 For Speech Recognition
☆149Feb 6, 2025Updated last year
WenRichard / man_qa
View on GitHub
WikiQA，复现论文《Multihop Atention Networks for Qestion Answer Matching》
☆11Mar 25, 2019Updated 7 years ago
qiujiali / lattice-rescore
View on GitHub
☆16Jun 13, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
YosukeHiguchi / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆16Jan 20, 2025Updated last year
zyascend / End-to-End-Speech-Recognition-Learning
View on GitHub
ASR, End-to-End, end2end, Speech Recognition, 端到端语音识别
☆12Oct 25, 2020Updated 5 years ago
tomastokar / Additive-Margin-Softmax
View on GitHub
Pytorch implementation of additive margin softmax loss
☆12Aug 5, 2021Updated 4 years ago
xinjli / asr2k
View on GitHub
asr2k
☆51Jun 2, 2024Updated 2 years ago
Kirili4ik / QuartzNet-ASR-pytorch
View on GitHub
Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.
☆16Nov 5, 2020Updated 5 years ago
hitz-zentroa / whisper-lm
View on GitHub
Add n-gram and large language model (LLM) support to Whisper models.
☆43May 6, 2025Updated last year
TeaPoly / CTC-OptimizedLoss
View on GitHub
Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.
☆59Sep 6, 2023Updated 2 years ago
j-min / MoChA-pytorch
View on GitHub
PyTorch Implementation of "Monotonic Chunkwise Attention" (ICLR 2018)
☆81Apr 2, 2018Updated 8 years ago
ARM-software / keyword-transformer
View on GitHub
Official implementation of the Keyword Transformer: https://arxiv.org/abs/2104.00769
☆141Apr 29, 2022Updated 4 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
zhangcong-zc / Text_Matching
View on GitHub
Text Matching Based on LCQMC: A Large-scale Chinese Question Matching Corpus
☆17Jan 12, 2021Updated 5 years ago
Kirili4ik / kws-attention-pytorch
View on GitHub
Keyword spotting for audio with attention (KWS model for audio)
☆18Jul 15, 2021Updated 5 years ago
espnet / warp-ctc
View on GitHub
Pytorch Bindings for warp-ctc maintained by ESPnet
☆17Feb 20, 2021Updated 5 years ago
SpeechColab / Leaderboard
View on GitHub
SpeechIO Leaderboard: a large, robust, comprehensive, benchmarking platform for Automatic Speech Recognition.
☆547Mar 29, 2025Updated last year
bliunlpr / Robust_e2e_gan
View on GitHub
PyTorch implementation of "Jointly Adversarial Enhancement Training for Robust End-to-End Speech Recognition"
☆19Jul 19, 2019Updated 7 years ago
k2-fsa / k2
View on GitHub
FSA/FST algorithms, differentiable, with PyTorch compatibility.
☆1,348Jul 11, 2026Updated last week
yongzhuo / gemma-sft
View on GitHub
Gemma-SFT, gemma-2b/gemma-7b微调(finetune,transformers)/LORA(peft)/推理(inference)
☆32May 17, 2024Updated 2 years ago