中文语音识别,automatic speech recognition(ASR)
☆14Dec 30, 2021Updated 4 years ago
Alternatives and similar repositories for asr_abc
Users that are interested in asr_abc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 语音识别数字0-9☆13Jul 16, 2019Updated 6 years ago
- ASR中文语音识别☆36Jul 30, 2019Updated 6 years ago
- 基于深度学习的普通话语音识别☆18Apr 23, 2019Updated 7 years ago
- Improving beat tracking algorithms with recurrent neural networks.☆11Jan 7, 2019Updated 7 years ago
- Supplementary material for the ISMIR 2020 paper: “Deconstruct, Analyse, Reconstruct: how to improve tempo, beat, and downbeat estimation”…☆12Mar 2, 2021Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 基于GMM的0-9孤立词语音识别系统☆10Sep 29, 2020Updated 5 years ago
- ggml学习笔记,ggml是一个机器学习的推理框架☆18Mar 24, 2024Updated 2 years ago
- Drop-in replacement for the Bazel build system's Android repository rules to automate the downloading and installation of the Android SDK☆11Jun 5, 2018Updated 7 years ago
- ☆11May 7, 2022Updated 4 years ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- 利用Python+TensorFlow实现语音识别☆47Oct 30, 2018Updated 7 years ago
- ☆11Aug 11, 2023Updated 2 years ago
- Keyword Search Recipe for Subword ASR☆30Jul 12, 2019Updated 6 years ago
- ☆14Aug 19, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- 分享在深蓝学院《语音识别:从入门到精通》第一期课程学习过程中完成的课后作业,供参考。☆21Sep 13, 2020Updated 5 years ago
- Simple baseline model for the HEAR benchmark☆23Feb 17, 2026Updated 3 months ago
- 中文语音识别☆24May 25, 2022Updated 4 years ago
- ☆11Oct 14, 2023Updated 2 years ago
- 使用python进行语音识别☆170Feb 16, 2022Updated 4 years ago
- DUSTED: Spoken-Term Discovery using Discrete Speech Units☆18Oct 2, 2024Updated last year
- ASR教程: https://dataxujing.github.io/ASR-paper/☆26Jul 1, 2024Updated last year
- ☆16Dec 23, 2021Updated 4 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- 深蓝学院语音课程《语音识别从入 门到精通》课程作业☆22Apr 2, 2020Updated 6 years ago
- Lightweight wrapper for Silero VAD using internal ONNX Runtime and with no python package dependencies☆17Nov 25, 2024Updated last year
- 深度学习实战项目(图像识别、语音识别、文本处理等)☆17Aug 2, 2019Updated 6 years ago
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆19Jun 5, 2023Updated 2 years ago
- ☆21Jul 22, 2022Updated 3 years ago
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆39Sep 9, 2025Updated 8 months ago
- NIST Language i-vector Machine Learning Challenge☆27Sep 15, 2016Updated 9 years ago
- 基于Kaldi的小词汇量汉语语音识别,使用DNN训练☆27Jan 15, 2019Updated 7 years ago
- Unet sensing image of tensorflow☆22May 22, 2018Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Note on Cohen's NS algorithm OMLSA-IMCRA, paper and code implementation☆18Aug 26, 2021Updated 4 years ago
- Repository for sharing the data in the Tamasheq language, one of the target languages for the low-resource speech translation track at IW…☆18Nov 30, 2022Updated 3 years ago
- The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]☆26May 14, 2026Updated last week
- End-to-end Text-to-Speech with Generative Adversarial Networks☆20Feb 6, 2021Updated 5 years ago
- ☆25Mar 12, 2022Updated 4 years ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Jun 14, 2021Updated 4 years ago
- tacotron-2(pytorch) + melgan(pytorch) chinese TTS☆26Jul 6, 2023Updated 2 years ago