DataXujing/ASR-paper

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DataXujing/ASR-paper)

DataXujing / ASR-paper

ASR教程: https://dataxujing.github.io/ASR-paper/

☆26

Alternatives and similar repositories for ASR-paper

Users that are interested in ASR-paper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

DataXujing / TTS-paper
View on GitHub
🔥 语音合成（TTS）,语音克隆教程: https://dataxujing.github.io/TTS-paper/#/
☆11Oct 29, 2024Updated last year
idiap / contextual-biasing-on-gpus
View on GitHub
Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…
☆21Sep 25, 2023Updated 2 years ago
dobby-seo / kosr
View on GitHub
Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)
☆31Feb 19, 2021Updated 5 years ago
Mddct / WeUSM
View on GitHub
☆13Mar 30, 2023Updated 3 years ago
AIBigTruth / 0-9-speech-recognition-system-based-on-GMM
View on GitHub
基于GMM的0-9孤立词语音识别系统
☆10Sep 29, 2020Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
TowerYsable / ASR_awesome
View on GitHub
语音识别论文前沿
☆53Jan 8, 2022Updated 4 years ago
du-ud / kaldi-cslt
View on GitHub
☆15Aug 30, 2022Updated 3 years ago
TeaPoly / CE-OptimizedLoss
View on GitHub
Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss with beam search and negative sampling strategy, Smoothed Max Po…
☆25Oct 11, 2024Updated last year
xingxingRealzyx / rv1126_bsp
View on GitHub
Rv1126 application construction framework, AI algorithm demo , etc
☆10Oct 9, 2021Updated 4 years ago
glynpu / asr_abc
View on GitHub
中文语音识别，automatic speech recognition(ASR)
☆14Dec 30, 2021Updated 4 years ago
SELMA-project / ml4audio
View on GitHub
audio, NLP, ML with huggingface, nvidia/nemo, speechbrain
☆11Sep 4, 2023Updated 2 years ago
Zeng-Jia / CMKD-MINDS
View on GitHub
Official pytorch implementation of Cross Modality Knowledge Distillation between A-mode Ultrasound and Surface Electromyography.
☆14May 23, 2023Updated 3 years ago
Mddct / simple-tts
View on GitHub
（WIP）long form speech generatoins
☆30Apr 2, 2025Updated last year
zzpDapeng / Transformer-Transducer
View on GitHub
A streamable speech recognition model with transformer encoders and RNN-T loss
☆11Mar 1, 2021Updated 5 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Mddct / cosyvoice2-flow-optimized
View on GitHub
faster inference
☆27Jan 20, 2025Updated last year
jinsongpan / ASR_Course_Homework
View on GitHub
分享在深蓝学院《语音识别：从入门到精通》第一期课程学习过程中完成的课后作业，供参考。
☆21Sep 13, 2020Updated 5 years ago
snsun / kaldi-decoder-code-reading
View on GitHub
☆33Oct 28, 2022Updated 3 years ago
luanshiyinyang / DeepLearningProject
View on GitHub
深度学习实战项目（图像识别、语音识别、文本处理等）
☆17Aug 2, 2019Updated 6 years ago
AlirezaMorsali / MLP-Attention
View on GitHub
☆17Dec 19, 2024Updated last year
zhengyima / GMM_Digital_Voice_Recognition
View on GitHub
基于GMM与MFCC特征进行数字0-9的语音识别，GMM，MFCC，语音识别，中文数据，sklearn，Digital Voice Recognition。
☆18Jun 21, 2022Updated 4 years ago
IMLHF / SpecAugmentPyTorch
View on GitHub
A Pytorch (support batch and channel) implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech…
☆11Jul 24, 2024Updated last year
giaminhgist / 3D-DAM
View on GitHub
A reproducible 3D convolutional neural network with dual attention module (3D-DAM) for Alzheimer's disease classification
☆18Nov 5, 2024Updated last year
oleges1 / quartznet-pytorch
View on GitHub
Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]
☆27Jul 16, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Jordain / Comfy_Image_Workshop
View on GitHub
A scalable solution that simplifies the integration of ComfyUI for developers
☆11Jul 15, 2024Updated 2 years ago
MenglingD / mandarin_speech_recognition
View on GitHub
基于深度学习的普通话语音识别
☆18Apr 23, 2019Updated 7 years ago
tomer9080 / WhisperRT-Streaming
View on GitHub
Causal streaming adaptation of OpenAI Whisper for real-time transcription on small audio chunks.
☆75Mar 31, 2026Updated 3 months ago
George0828Zhang / torch_cif
View on GitHub
A fast parallel PyTorch implementation of the "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition" https://arxiv.org/ab…
☆37Feb 10, 2024Updated 2 years ago
CelineChen95 / Shenlan-ASR-Course
View on GitHub
深蓝学院语音课程《语音识别从入门到精通》课程作业
☆22Apr 2, 2020Updated 6 years ago
YounesAbounaceur / Conferences_Recommender_System
View on GitHub
For all the researchers who after waiting for a long time, they get their research papers rejected by big international conferences. This…
☆10Dec 6, 2020Updated 5 years ago
Intersection98 / ComfyUI_MX_post_processing-nodes
View on GitHub
☆13May 23, 2024Updated 2 years ago
AlessioMichelassi / openPyVision_013
View on GitHub
Welcome to my project. OpenPyVision is a real time videoMixer based on opencv and pyqt6.
☆14Aug 22, 2024Updated last year
TartuNLP / tts_preprocess_et
View on GitHub
Estonian text-to-speech text normalization pipeline
☆14Dec 17, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
wolfparticle / lee-nlp_asr2020
View on GitHub
主要参考李宏毅老师2020年人类语言处理课程资料整理，包括代码和ppt
☆34May 25, 2021Updated 5 years ago
DataXujing / NLP-paper
View on GitHub
NLP 自然语言处理教程 https://dataxujing.github.io/NLP-paper/
☆30Sep 17, 2021Updated 4 years ago
WillBrennan / MotionDetector
View on GitHub
a motion detector for video; written with OpenCV
☆12Nov 3, 2022Updated 3 years ago
AssemblyAI / kaldi-asr-tutorial
View on GitHub
Repo for hosting tutorial code associated with the Kaldi Speech Recognition for Beginners - A Simple Tutorial blog by AssemblyAI
☆13May 20, 2023Updated 3 years ago
1ytic / warp-rna
View on GitHub
Recurrent Neural Aligner
☆51Apr 14, 2020Updated 6 years ago
UBC-NLP / octopus
View on GitHub
Octopus is a neural machine generation toolkit for Arabic Natural Lnagauge Generation (NLG)
☆10Apr 29, 2024Updated 2 years ago
AIAnytime / PaliGemma-Inference-and-Fine-Tuning
View on GitHub
PaliGemma Inference and Fine Tuning
☆13May 15, 2024Updated 2 years ago