DakeQQ / Automatic-Speech-Recognition-ASR-ONNXView external linksLinks
Utilizes ONNX Runtime to transcribe audio into text.
☆81Updated this week
Alternatives and similar repositories for Automatic-Speech-Recognition-ASR-ONNX
Users that are interested in Automatic-Speech-Recognition-ASR-ONNX are comparing it to the libraries listed below
Sorting:
- Utilizes ONNX Runtime for speech activity detection.☆41Dec 10, 2025Updated 2 months ago
- Utilizes ONNX Runtime for audio denoising.☆115Dec 27, 2025Updated last month
- Transcribe subtitles and translate them offline with ease.☆40Jan 10, 2026Updated last month
- Export the STFT or ISTFT process in ONNX format.☆40Nov 21, 2025Updated 2 months ago
- 修复funasr中seaco-paraformer导出onnx后没有时间戳的bug☆24Sep 12, 2024Updated last year
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated last year
- A library for adding punctuation into a text from ASR.☆19May 8, 2023Updated 2 years ago
- Running the F5-TTS by ONNX Runtime standalone with GUI☆24Dec 10, 2024Updated last year
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆109Aug 16, 2024Updated last year
- Utilizes ONNX Runtime for TTS model.☆49Updated this week
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆29Aug 31, 2025Updated 5 months ago
- Demonstration of combine YOLO and depth estimation on Android device.☆67Nov 15, 2025Updated 3 months ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- ☆25Jan 5, 2026Updated last month
- OpenAI Whisper demo on Axera☆14Jan 15, 2026Updated last month
- ☆10Sep 2, 2024Updated last year
- Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…☆26Mar 13, 2025Updated 11 months ago
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- Openfst mirror with some fixes☆14Aug 23, 2024Updated last year
- Official PyTorch implementation of (ICME2025 oral) "AutoStyle-TTS: Retrieval-Augmented Generation based Automatic Style Matching Text-to-…☆17Feb 1, 2026Updated 2 weeks ago
- Transfer learning approach to pronunciation scoring☆11Jan 17, 2024Updated 2 years ago
- Demonstrate Yolov9 model with Qualcomm Hexagon NPU and DirectML☆12Nov 27, 2024Updated last year
- 使用OpenCV部署图像描述Image_Captioning,包含C++和Python两个版本的程序☆12Dec 22, 2023Updated 2 years ago
- Using OpenVINO to speed up MeloTTS inference☆15Nov 1, 2024Updated last year
- DST is a Decoder-only simultaneous machine translation model, which can conduct policy decision and translation concurrently☆11Jun 6, 2024Updated last year
- Tracking beer/wine using Audio Event Detection with Machine Learning☆15Jun 16, 2024Updated last year
- The Official PyTorch Implementation of "Mel-McNet: A Mel-Scale Framework for Online Multichannel Speech Enhancement" [Interspeech 2025]☆21Jun 9, 2025Updated 8 months ago
- StyleTTS2 + Vocos as a Decoder☆13Mar 24, 2025Updated 10 months ago
- 将normalize过的中文文本,做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。☆13Apr 7, 2021Updated 4 years ago
- Running the F5-TTS by ONNX Runtime☆191Jan 7, 2026Updated last month
- Python runtime for WeTextProcessing (does not depend on Pynini)☆48Nov 28, 2025Updated 2 months ago
- IPA Phonemizer/Dephonemizer for 140 human languages☆54Updated this week
- Code release for "TinySpeech: Attention Condensers for Deep Speech Recognition Neural Networks on Edge Devices"☆21Jun 7, 2025Updated 8 months ago
- Forced alignment decoder for Whisper.☆14Mar 13, 2024Updated last year
- Clean and modernized implementation of FastSpeech2/LightSpeech using IPA☆18Aug 16, 2024Updated last year
- A Study of Low-Resource Speech Commands Recognition Based on Adversarial Reprogramming☆19Oct 12, 2023Updated 2 years ago
- ☆14Aug 19, 2024Updated last year
- Demonstration of running a native LLM on Android device.☆226Updated this week
- (WIP) A retrain of F5-TTS on permissively-licensed data☆13Apr 6, 2025Updated 10 months ago