Utilizes ONNX Runtime to transcribe audio into text.
☆84May 14, 2026Updated 2 weeks ago
Alternatives and similar repositories for Automatic-Speech-Recognition-ASR-ONNX
Users that are interested in Automatic-Speech-Recognition-ASR-ONNX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Utilizes ONNX Runtime for speech activity detection.☆44Dec 10, 2025Updated 5 months ago
- Transcribe subtitles and translate them offline with ease.☆45Jan 10, 2026Updated 4 months ago
- Utilizes ONNX Runtime for audio denoising.☆125Dec 27, 2025Updated 5 months ago
- Export the STFT or ISTFT process in ONNX format.☆43Mar 16, 2026Updated 2 months ago
- 修复funasr中seaco-paraformer导出onnx后没有时间戳的bug☆25Sep 12, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Demonstration of combine YOLO and depth estimation on Android device.☆71Nov 15, 2025Updated 6 months ago
- OpenAI Whisper demo on Axera☆15Jan 15, 2026Updated 4 months ago
- Running the F5-TTS by ONNX Runtime standalone with GUI☆25Dec 10, 2024Updated last year
- A library for adding punctuation into a text from ASR.☆19May 8, 2023Updated 3 years ago
- ☆43Mar 18, 2026Updated 2 months ago
- A lightweight demo of FunASR-Nano using ONNX runtime.☆78Feb 25, 2026Updated 3 months ago
- Running the F5-TTS by ONNX Runtime☆197May 15, 2026Updated 2 weeks ago
- 使用OpenCV部署图像描述Image_Captioning,包含C++和Python两个版本的程序☆12Dec 22, 2023Updated 2 years ago
- Demonstration of running a native LLM on Android device.☆252Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated 2 years ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆115Aug 16, 2024Updated last year
- Unofficial Implementation of Latent Diffusion Models for Layout-to-image Generation☆12Nov 10, 2022Updated 3 years ago
- 将normalize过的中文文本,做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。☆13Apr 7, 2021Updated 5 years ago
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆35Apr 14, 2026Updated last month
- Transfer learning approach to pronunciation scoring☆12Jan 17, 2024Updated 2 years ago
- ☆24Jul 17, 2024Updated last year
- 该项目来源于阿里开源的语音降噪模型zipEnhancer☆38May 8, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Crawling and creating a German language model resource☆18Aug 23, 2022Updated 3 years ago
- Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.☆743May 14, 2026Updated 2 weeks ago
- Example of SenseCraft Model Assistant Model deployment related to ESP32☆33Apr 9, 2025Updated last year
- Python runtime for WeTextProcessing (does not depend on Pynini)☆51Nov 28, 2025Updated 6 months ago
- poorman's ar-dit tts☆45Dec 31, 2025Updated 4 months ago
- low-latency realtime ASR based on FireRedASR☆61Jul 8, 2025Updated 10 months ago
- Pseudo Streaming SenseVoice with Hotwords☆449Mar 13, 2025Updated last year
- ☆33Aug 6, 2021Updated 4 years ago
- Qwen3-TTS with nano vLLM-style optimizations for fast text-to-speech generation. Achieved 3x faster☆123Mar 3, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- IPA Phonemizer/Dephonemizer for 140 human languages☆58May 6, 2026Updated 3 weeks ago
- Port of Funasr's Paraformer model in C/C++☆43Jun 19, 2024Updated last year
- ☆42Apr 29, 2026Updated last month
- 基于InternLm chat 7B大模型基座,构建一个Agent ,可以调用 MMYOLO 工具来完成图像内视觉任务☆11Oct 30, 2024Updated last year
- Fast CosyVoice3 inference with tensorRT and tensorRT-LLM☆73Mar 7, 2026Updated 2 months ago
- ☆22Jul 29, 2024Updated last year
- ☆15Apr 16, 2026Updated last month