ASR中文语音识别
☆35Jul 30, 2019Updated 6 years ago
Alternatives and similar repositories for SpeechRecognition
Users that are interested in SpeechRecognition are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- transformer的 encoder-decoder结构基于tensorflow实现的中文语音识别项目☆34Feb 24, 2021Updated 5 years ago
- 利用Python+TensorFlow实现语音识别☆47Oct 30, 2018Updated 7 years ago
- 基于GMM与MFCC特征进行数字0-9的语音识别,GMM,MFCC,语音识 别,中文数据,sklearn,Digital Voice Recognition。☆18Jun 21, 2022Updated 3 years ago
- 基于深度学习的普通话语音识别☆18Apr 23, 2019Updated 7 years ago
- 端到端中文语音识别☆93Jul 25, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- BERT for few_shot_learning, provided siamese net, match net, prototypical net☆12Jan 6, 2020Updated 6 years ago
- 中文语音识别系列,读者可以借助它快速训练属于自己的中文语音识别模型,或直接使用预训练模型测试效果。☆282Mar 23, 2021Updated 5 years ago
- seq_2_seq text generation based on transformers☆22Feb 18, 2021Updated 5 years ago
- AI合成原神人物语音,合成出来有点搞笑(噗~)☆12Jan 9, 2023Updated 3 years ago
- 다양한 feature와 deep learning을 이용한 Phoneme Recognition입니다.☆13Nov 27, 2019Updated 6 years ago
- 语音识别数字0-9☆14Jul 16, 2019Updated 6 years ago
- Web app created to collect audios for course project☆10Apr 6, 2018Updated 8 years ago
- Calculate MFCC/Fbank feature for wav files☆15Nov 21, 2017Updated 8 years ago
- ☆21Apr 12, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 基于GMM的0-9孤立词语音识别系统☆10Sep 29, 2020Updated 5 years ago
- 以音素建模构建NN-CTC声学模型☆15May 14, 2019Updated 7 years ago
- Using deep neural nets to write books☆13Dec 9, 2016Updated 9 years ago
- A Simple Automatic Speech Recognition (ASR) Model in Tensorflow, which only needs to focus on Deep Neural Network. It's easy to test popu…☆19Jan 18, 2018Updated 8 years ago
- Automatic Speech Recognition with TensorFlow(CNN+BLSTM+CTC)☆12Aug 9, 2018Updated 7 years ago
- End-to-End Automatic Speech Recognition on PyTorch☆304Jun 2, 2022Updated 4 years ago
- Tacotron2 with BERT examples☆10Jul 8, 2019Updated 6 years ago
- Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…☆14Jul 2, 2020Updated 5 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Jun 15, 2019Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 数字图像处理项目。完成基础的头歌数字图像处理功能+9种风格迁移+ocr识别身份证文字信息☆11Jul 2, 2023Updated 2 years ago
- ASR教程: https://dataxujing.github.io/ASR-paper/☆26Jul 1, 2024Updated last year
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- 深蓝学院语音课程《语音识别从入门到精通》课程作业☆22Apr 2, 2020Updated 6 years ago
- 语音识别理论、论文和PPT☆618Aug 7, 2024Updated last year
- [ACL 2025 Main] A Comprehensive Mandarin Speech Dataset for Young Children Aged 3-5☆55Mar 19, 2025Updated last year
- 中文语音识别; Mandarin Automatic Speech Recognition;☆1,969Jul 25, 2024Updated last year
- 中文预训练模型生成字向量学习,测试BERT,ELMO的中文效果☆100Jan 22, 2020Updated 6 years ago
- 利用GPT2实现的闲聊模型☆12Apr 22, 2021Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Conformer encoder + Transformer decoder with Hybrid CTC/attention☆12Nov 11, 2021Updated 4 years ago
- [Interspeech22]Improving Mispronunciation Detection with Wav2vec2-based Momentum Pseudo-Labeling for Accentedness and Intelligibility Ass…☆38Jan 23, 2024Updated 2 years ago
- 数据分析与处理实践 (包括:#基本数据预处理操作;#机器学习基本算法实现。)☆17Aug 23, 2018Updated 7 years ago
- 本科毕业设计-基于深度学习的模糊人脸图像增强系统的设计与实现☆10Jan 12, 2018Updated 8 years ago
- 基于lstm,bilstm的language model 中文维基百科数据集☆16Mar 8, 2019Updated 7 years ago
- 安坐sity ——基于视觉识别的坐姿矫正☆13Apr 26, 2021Updated 5 years ago
- 用CASIA database数据集做的,做的语音情感识别和语音识人的练习☆74Dec 17, 2022Updated 3 years ago