zw76859420/ASR_Theory

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zw76859420/ASR_Theory)

zw76859420 / ASR_Theory

语音识别理论、论文和PPT

☆618

Alternatives and similar repositories for ASR_Theory

Users that are interested in ASR_Theory are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zw76859420 / ASR_Syllable
View on GitHub
基于卷积神经网络的语音识别声学模型的研究
☆181Jul 22, 2019Updated 7 years ago
nl8590687 / ASRT_SpeechRecognition
View on GitHub
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
☆8,383Apr 10, 2026Updated 3 months ago
Sundy1219 / eesen-for-thchs30
View on GitHub
ASR for Chinese Mandarin
☆76Jun 1, 2018Updated 8 years ago
kaituoxu / Speech-Transformer
View on GitHub
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
☆810Apr 6, 2023Updated 3 years ago
nobody132 / masr
View on GitHub
中文语音识别; Mandarin Automatic Speech Recognition;
☆1,967Jul 25, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
xxbb1234021 / speech_recognition
View on GitHub
中文语音识别
☆849May 16, 2018Updated 8 years ago
jinsongpan / ASR_Course_Homework
View on GitHub
分享在深蓝学院《语音识别：从入门到精通》第一期课程学习过程中完成的课后作业，供参考。
☆21Sep 13, 2020Updated 5 years ago
zw76859420 / ASR_WORD
View on GitHub
采用端到端方法构建声学模型，以字为建模单元，采用DCNN-CTC网络结构。
☆71Jan 26, 2019Updated 7 years ago
sailist / ASRFrame
View on GitHub
An Automatic Speech Recognition Frame ，一个中文语音识别的完整框架，提供了多个模型
☆252Jan 6, 2021Updated 5 years ago
zzw922cn / awesome-speech-recognition-speech-synthesis-papers
View on GitHub
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synth…
☆3,127Oct 19, 2023Updated 2 years ago
zw76859420 / ASR_Phone
View on GitHub
以音素建模构建NN-CTC声学模型
☆16May 14, 2019Updated 7 years ago
Alexander-H-Liu / End-to-end-ASR-Pytorch
View on GitHub
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pyt…
☆1,210Dec 19, 2020Updated 5 years ago
wenet-e2e / wenet
View on GitHub
Production First and Production Ready End-to-End Speech Recognition Toolkit
☆5,210Jun 15, 2026Updated last month
xdcesc / my_ch_speech_recognition
View on GitHub
使用python进行语音识别
☆170Feb 16, 2022Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
wenet-e2e / speech-recognition-papers
View on GitHub
Towards hot directions in industrial end to end speech recognition
☆329Nov 30, 2021Updated 4 years ago
iamxiaoyubei / Voice-Tech-Study
View on GitHub
语音识别语音前端处理语音合成语音转换等等语音技术的资料汇总
☆23Nov 8, 2019Updated 6 years ago
YoavRamon / awesome-kaldi
View on GitHub
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
☆536Feb 9, 2022Updated 4 years ago
XiaoMi / kaldi-onnx
View on GitHub
Kaldi model converter to ONNX
☆248Jan 27, 2023Updated 3 years ago
gentaiscool / end2end-asr-pytorch
View on GitHub
End-to-End Automatic Speech Recognition on PyTorch
☆304Jun 2, 2022Updated 4 years ago
srvk / eesen
View on GitHub
The official repository of the Eesen project
☆834May 23, 2019Updated 7 years ago
hirofumi0810 / neural_sp
View on GitHub
End-to-end ASR/LM implementation with PyTorch
☆594Aug 30, 2021Updated 4 years ago
tzyll / kaldi
View on GitHub
ASR cases for speech handbook at CSLT-THU, based on Kaldi toolkit and Thchs30 database, in egs/cslt_cases.
☆107Mar 12, 2021Updated 5 years ago
linan2 / TensorFlow-speech-enhancement-Chinese
View on GitHub
基于深度学习的语音增强、去混响
☆102Jan 30, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mindorii / kws
View on GitHub
An End-to-End Architecture for Keyword Spotting and Voice Activity Detection
☆387Mar 24, 2023Updated 3 years ago
aishell-foundation / DaCiDian
View on GitHub
DaCiDian is an open-sourced chinese mandarin lexicon for automatic speech recognition(ASR)
☆301Jun 15, 2020Updated 6 years ago
nwpuaslp / ASR_Course
View on GitHub
☆149Aug 2, 2020Updated 5 years ago
kaldi-asr / kaldi
View on GitHub
kaldi-asr/kaldi is the official location of the Kaldi project.
☆15,440Sep 22, 2025Updated 10 months ago
MenglingD / mandarin_speech_recognition
View on GitHub
基于深度学习的普通话语音识别
☆18Apr 23, 2019Updated 7 years ago
mravanelli / pytorch-kaldi
View on GitHub
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…
☆2,401Mar 14, 2022Updated 4 years ago
espnet / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆9,904Updated this week
open-speech / speech-aligner
View on GitHub
speech-aligner，是一个从“人声语音”及其“语言文本”，产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech an…
☆410Apr 8, 2020Updated 6 years ago
Z-yq / TensorflowASR
View on GitHub
一个执着于让CPU\端侧-Model逼近GPU-Model性能的项目，CPU上的实时率(RTF)小于0.1
☆475Mar 13, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
syoyo / tacotron-tts-cpp
View on GitHub
Tacotron text to speech in C++(synthesize only)
☆77Oct 17, 2019Updated 6 years ago
ZhengkunTian / OpenTransformer
View on GitHub
A No-Recurrence Sequence-to-Sequence Model for Speech Recognition
☆378Jul 21, 2022Updated 4 years ago
kaituoxu / Listen-Attend-Spell
View on GitHub
A PyTorch implementation of Listen, Attend and Spell (LAS), an End-to-End ASR framework.
☆208Jan 8, 2019Updated 7 years ago
jx1100370217 / DFCNN-master
View on GitHub
这是一个基于全卷积神经网络的语音识别系统
☆79Jun 28, 2019Updated 7 years ago
chenkui164 / FastASR
View on GitHub
这是一个用C++实现ASR推理的项目，它依赖很少，安装也很简单，推理速度很快，在树莓派4B等ARM平台也可以流畅的运行。支持的模型是由Google的Transformer模型中优化而来，数据集是开源wenetspeech(10000+小时)或阿里私有数据集(60000+小…
☆554Mar 19, 2023Updated 3 years ago
xiangxyq / minimize-chain-decoder
View on GitHub
Minimize kaldi nnet3 chain decoder
☆45Jan 10, 2020Updated 6 years ago
Diamondfan / CTC_pytorch
View on GitHub
CTC end -to-end ASR for timit and 863 corpus.
☆219Dec 20, 2019Updated 6 years ago