ASR教程: https://dataxujing.github.io/ASR-paper/
☆26Jul 1, 2024Updated last year
Alternatives and similar repositories for ASR-paper
Users that are interested in ASR-paper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🔥 语音合成(TTS),语音克隆教程: https://dataxujing.github.io/TTS-paper/#/☆11Oct 29, 2024Updated last year
- Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…☆21Sep 25, 2023Updated 2 years ago
- 语音识别数字0-9☆13Jul 16, 2019Updated 6 years ago
- ☆13Mar 30, 2023Updated 3 years ago
- Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)☆31Feb 19, 2021Updated 5 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 基于GMM的0-9孤立词语音识别系统☆10Sep 29, 2020Updated 5 years ago
- ☆15Aug 30, 2022Updated 3 years ago
- 语音识别 论文 前沿☆53Jan 8, 2022Updated 4 years ago
- Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss with beam search and negative sampling strategy, Smoothed Max Po…☆25Oct 11, 2024Updated last year
- Causal streaming adaptation of OpenAI Whisper for real-time transcription on small audio chunks.☆71Mar 31, 2026Updated 3 weeks ago
- 中文语音识别,automatic speech recognition(ASR)☆14Dec 30, 2021Updated 4 years ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated last year
- faster inference☆28Jan 20, 2025Updated last year
- A streamable speech recognition model with transformer encoders and RNN-T loss☆11Mar 1, 2021Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This is the home directory to speaker diarization module being developed for Hetergeneous News data in RedHen Labs as a GSOC Project☆10Sep 11, 2015Updated 10 years ago
- 分享在深蓝学院《语音识别:从入门到精通》第一期课程学习过程中完成的课后作业,供参考。☆22Sep 13, 2020Updated 5 years ago
- ☆32Oct 28, 2022Updated 3 years ago
- audio, NLP, ML with huggingface, nvidia/nemo, speechbrain☆11Sep 4, 2023Updated 2 years ago
- A scalable solution that simplifies the integration of ComfyUI for developers☆11Jul 15, 2024Updated last year
- An adaptive comb filtering algorithm for the enhancement of harmonic signals in the presence of additive white noise. The algorithm impro…☆14Jan 10, 2023Updated 3 years ago
- 基于GMM与MFCC特征进行数字0-9的语音识别,GMM,MFCC,语音识别,中文数据,sklearn,Digital Voice Recognition。☆18Jun 21, 2022Updated 3 years ago
- Official pytorch implementation of Cross Modality Knowledge Distillation between A-mode Ultrasound and Surface Electromyography.☆14May 23, 2023Updated 2 years ago
- Implementation for WatchYourMouth: Silent Speech Recognition with Depth Sensing presented at CHI 2024☆19Oct 6, 2025Updated 6 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- 中文语音识别☆25May 25, 2022Updated 3 years ago
- A Pytorch (support batch and channel) implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech…☆12Jul 24, 2024Updated last year
- ☆13May 23, 2024Updated last year
- A reproducible 3D convolutional neural network with dual attention module (3D-DAM) for Alzheimer's disease classification☆19Nov 5, 2024Updated last year
- Welcome to my project. OpenPyVision is a real time videoMixer based on opencv and pyqt6.☆14Aug 22, 2024Updated last year
- 基于深度学习的普通话语音识别☆18Apr 23, 2019Updated 7 years ago
- state-of-the-art models for diacritics restoration for Arabic language☆17Feb 23, 2025Updated last year
- Octopus is a neural machine generation toolkit for Arabic Natural Lnagauge Generation (NLG)☆10Apr 29, 2024Updated 2 years ago
- Transcribe desktop audio/computer audio in real-time and locally (Streaming ASR), using TorchAudio and Emformer-RNNT model for inference,…☆14May 7, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 深蓝学院语音课程《语音识别从入门到精通》课程作业☆22Apr 2, 2020Updated 6 years ago
- 主要参考李宏毅老师2020年人类语言处理课程资料整理,包括代码和ppt☆34May 25, 2021Updated 4 years ago
- NLP 自然语言处理教程 https://dataxujing.github.io/NLP-paper/☆30Sep 17, 2021Updated 4 years ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆26Jul 16, 2021Updated 4 years ago
- Vietnamese Punctuation Prediction using Pretrained Language Models☆14May 8, 2022Updated 3 years ago
- 深度学习实战项目(图像识别、语音识别 、文本处理等)☆17Aug 2, 2019Updated 6 years ago
- Estonian text-to-speech text normalization pipeline☆12Dec 17, 2025Updated 4 months ago