ASR教程: https://dataxujing.github.io/ASR-paper/
☆25Jul 1, 2024Updated last year
Alternatives and similar repositories for ASR-paper
Users that are interested in ASR-paper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 🔥 语音合成(TTS),语音克隆教程: https://dataxujing.github.io/TTS-paper/#/☆11Oct 29, 2024Updated last year
- ☆13Aug 14, 2023Updated 2 years ago
- Implementation of the contextual biasing for ASR decoding on GPUs without lattice generation. The code supports submission to Interspeech…☆21Sep 25, 2023Updated 2 years ago
- 语音识别数字0-9☆13Jul 16, 2019Updated 6 years ago
- ☆13Mar 30, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 基于GMM的0-9孤立词语 音识别系统☆10Sep 29, 2020Updated 5 years ago
- ☆15Aug 30, 2022Updated 3 years ago
- 语音识别 论文 前沿☆52Jan 8, 2022Updated 4 years ago
- Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss with beam search and negative sampling strategy, Smoothed Max Po…☆25Oct 11, 2024Updated last year
- 中文语音识别,automatic speech recognition(ASR)☆14Dec 30, 2021Updated 4 years ago
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated last year
- faster inference☆28Jan 20, 2025Updated last year
- A streamable speech recognition model with transformer encoders and RNN-T loss☆11Mar 1, 2021Updated 5 years ago
- This is the home directory to speaker diarization module being developed for Hetergeneous News data in RedHen Labs as a GSOC Project☆10Sep 11, 2015Updated 10 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- ☆32Oct 28, 2022Updated 3 years ago
- audio, NLP, ML with huggingface, nvidia/nemo, speechbrain☆11Sep 4, 2023Updated 2 years ago
- A scalable solution that simplifies the integration of ComfyUI for developers☆11Jul 15, 2024Updated last year
- An adaptive comb filtering algorithm for the enhancement of harmonic signals in the presence of additive white noise. The algorithm impro…☆14Jan 10, 2023Updated 3 years ago
- 基于GMM与MFCC特征进行数字0-9的语音识别,GMM,MFCC,语音识别,中文数据,sklearn,Digital Voice Recognition。☆19Jun 21, 2022Updated 3 years ago
- Official pytorch implementation of Cross Modality Knowledge Distillation between A-mode Ultrasound and Surface Electromyography.☆14May 23, 2023Updated 2 years ago
- Implementation for WatchYourMouth: Silent Speech Recognition with Depth Sensing presented at CHI 2024☆19Oct 6, 2025Updated 6 months ago
- 中文语音识别☆25May 25, 2022Updated 3 years ago
- ☆17Dec 19, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆13May 23, 2024Updated last year
- A reproducible 3D convolutional neural network with dual attention module (3D-DAM) for Alzheimer's disease classification☆19Nov 5, 2024Updated last year
- Welcome to my project. OpenPyVision is a real time videoMixer based on opencv and pyqt6.☆14Aug 22, 2024Updated last year
- 基于深度学习的普通话语音识别☆18Apr 23, 2019Updated 6 years ago
- state-of-the-art models for diacritics restoration for Arabic language☆17Feb 23, 2025Updated last year
- Transcribe desktop audio/computer audio in real-time and locally (Streaming ASR), using TorchAudio and Emformer-RNNT model for inference,…☆13May 7, 2024Updated last year
- 深蓝学院语音课程《语音识别从入门到精通》课程作业☆22Apr 2, 2020Updated 6 years ago
- 主要参考李宏毅老师2020年人类语言处理课程资料整理,包括代码和ppt☆34May 25, 2021Updated 4 years ago
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]☆26Jul 16, 2021Updated 4 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Vietnamese Punctuation Prediction using Pretrained Language Models☆14May 8, 2022Updated 3 years ago
- 深度学习实战项目(图像识别、语音识别、文本处理等)☆17Aug 2, 2019Updated 6 years ago
- Estonian text-to-speech text normalization pipeline☆12Dec 17, 2025Updated 3 months ago
- Speech recognition API service powered by FunASR and Qwen-ASR, supporting 52 languages, compatible with OpenAI API and Alibaba Cloud Spee…☆229Mar 31, 2026Updated last week
- Recurrent Neural Aligner☆51Apr 14, 2020Updated 5 years ago
- Fast Punctuation Restoration using Transformer Models for Vietnamese☆11Jun 10, 2022Updated 3 years ago
- Repo for hosting tutorial code associated with the Kaldi Speech Recognition for Beginners - A Simple Tutorial blog by AssemblyAI☆13May 20, 2023Updated 2 years ago