Utilizes ONNX Runtime to transcribe audio into text.
☆82Mar 16, 2026Updated 2 weeks ago
Alternatives and similar repositories for Automatic-Speech-Recognition-ASR-ONNX
Users that are interested in Automatic-Speech-Recognition-ASR-ONNX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Utilizes ONNX Runtime for speech activity detection.☆42Dec 10, 2025Updated 3 months ago
- Transcribe subtitles and translate them offline with ease.☆40Jan 10, 2026Updated 2 months ago
- Utilizes ONNX Runtime for audio denoising.☆120Dec 27, 2025Updated 3 months ago
- Export the STFT or ISTFT process in ONNX format.☆42Mar 16, 2026Updated last week
- Utilizes ONNX Runtime for TTS model.☆50Mar 19, 2026Updated last week
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- 修复funasr中seaco-paraformer导出onnx后没有时间戳的bug☆25Sep 12, 2024Updated last year
- ☆26Mar 18, 2026Updated last week
- OpenAI Whisper demo on Axera☆14Jan 15, 2026Updated 2 months ago
- Running the F5-TTS by ONNX Runtime standalone with GUI☆24Dec 10, 2024Updated last year
- Demonstrate Yolov9 model with Qualcomm Hexagon NPU and DirectML☆12Nov 27, 2024Updated last year
- A library for adding punctuation into a text from ASR.☆19May 8, 2023Updated 2 years ago
- A lightweight demo of FunASR-Nano using ONNX runtime.☆65Feb 25, 2026Updated last month
- Running the F5-TTS by ONNX Runtime☆194Jan 7, 2026Updated 2 months ago
- 使用OpenCV部署图像描述Image_Captioning,包含C++和Python两个版本的程序☆12Dec 22, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated last year
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆110Aug 16, 2024Updated last year
- Unofficial Implementation of Latent Diffusion Models for Layout-to-image Generation☆12Nov 10, 2022Updated 3 years ago
- 将normalize过的中文文本,做逆向normalize。具体功能即实现 chinese_text_normalization的逆向版本。☆13Apr 7, 2021Updated 4 years ago
- Official implementation of the paper "Speech Intelligibility Assessment of Dysarthric Speech by using Goodness of Pronunciation with Unce…☆27Mar 13, 2025Updated last year
- One command to start a streaming ASR server.☆12Oct 2, 2024Updated last year
- Transfer learning approach to pronunciation scoring☆12Jan 17, 2024Updated 2 years ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆31Mar 9, 2026Updated 2 weeks ago
- Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.☆702Mar 19, 2026Updated last week
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Crawling and creating a German language model resource☆18Aug 23, 2022Updated 3 years ago
- Example of SenseCraft Model Assistant Model deployment related to ESP32☆32Apr 9, 2025Updated 11 months ago
- Scripts for training Kaldi for German speech recognition (ASR).☆27Feb 11, 2021Updated 5 years ago
- Python runtime for WeTextProcessing (does not depend on Pynini)☆49Nov 28, 2025Updated 4 months ago
- poorman's ar-dit tts☆45Dec 31, 2025Updated 2 months ago
- low-latency realtime ASR based on FireRedASR☆59Jul 8, 2025Updated 8 months ago
- Port of Funasr's Paraformer model in C/C++☆40Jun 19, 2024Updated last year
- Pseudo Streaming SenseVoice with Hotwords☆442Mar 13, 2025Updated last year
- ☆33Aug 6, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- IPA Phonemizer/Dephonemizer for 140 human languages☆57Updated this week
- 基于InternLm chat 7B大模型基座,构建一个Agent ,可以调用 MMYOLO 工具来完成图像内视觉任务☆11Oct 30, 2024Updated last year
- ☆21Jul 29, 2024Updated last year
- ☆15Aug 22, 2025Updated 7 months ago
- C++ implementation of "Mobile Vision Transformer-based Visual Object Tracking" (BMVC2023) and "Separable Self and Mixed Attention Transf…☆12Apr 23, 2024Updated last year
- A Benchmark Corpus for Low-Resource Cantonese Punctuation Restoration from Speech Transcripts☆16Dec 3, 2024Updated last year
- Infrastructure useful to create natural language processing systems based on transformer networks☆12Sep 26, 2019Updated 6 years ago