HaujetZhao/Qwen3-ASR-GGUF

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HaujetZhao/Qwen3-ASR-GGUF)

HaujetZhao / Qwen3-ASR-GGUF

将 Qwen3-ASR 的 LLM 部分导出为 GGUF，用 llama.cpp 进行加速推理。后者支持 Vulkan 和 Cuda 加速。

☆196

Alternatives and similar repositories for Qwen3-ASR-GGUF

Users that are interested in Qwen3-ASR-GGUF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

HaujetZhao / Fun-ASR-GGUF
View on GitHub
用 onnx 和 gguf 格式混合运行 Fun-ASR-Nano 模型全流程
☆154May 5, 2026Updated 2 months ago
HaujetZhao / Qwen3-TTS-GGUF
View on GitHub
最极速的Qwen3-TTS推理方案。将 Qwen3-TTS 的 LLM 部分导出为 GGUF，用 llama.cpp 进行加速推理。后者支持 Vulkan 和 Cuda 加速。
☆165Jun 11, 2026Updated last month
shershah1024 / qwen3-asr-llamacpp
View on GitHub
Qwen3-ASR speech-to-text for llama.cpp — patch, GGUF models, and benchmarks
☆15Feb 2, 2026Updated 5 months ago
predict-woo / qwen3-asr.cpp
View on GitHub
Implementation of Qwen3-ASR-0.6B in GGML
☆101Updated this week
Wasser1462 / FunASR-nano-onnx
View on GitHub
A lightweight demo of FunASR-Nano using ONNX runtime.
☆83Feb 25, 2026Updated 5 months ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
alan890104 / qwen3-asr-rs
View on GitHub
Pure-Rust inference engine for Qwen3-ASR speech recognition models (0.6B & 1.7B) using candle with Metal/CUDA acceleration
☆25Mar 17, 2026Updated 4 months ago
yuekaizhang / Fun-ASR-vllm
View on GitHub
Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.
☆107Jul 7, 2026Updated 2 weeks ago
HaujetZhao / SenseVoice-ONNX
View on GitHub
SenseVoice-Small 导出为 ONNX，支持热词注入，在 CTC 的输空间中通过路径匹配，1ms 内实现热词替换
☆28Jun 3, 2026Updated last month
miemiekurisu / qwen3asr_cpu
View on GitHub
A high-performance C/C++ inference server for Qwen3-ASR , optimized for CPU/GPU real-time streaming speech recognition.
☆15Jun 27, 2026Updated 3 weeks ago
QwenAudio / Fun-ASR
View on GitHub
Open-source LLM-based ASR model family for Chinese, dialect, accent, and multilingual speech, with FunASR, vLLM, streaming, and llama.cpp…
☆1,425Updated this week
manyeyes / ManySpeech
View on GitHub
AI Speech Solutions for Tasks such as ASR, Vocal Extraction, Accompaniment Extraction, Audio Denoising, and Enhancement, Support models s…
☆85Jun 16, 2026Updated last month
predict-woo / qwen3-tts.cpp
View on GitHub
☆221Jul 18, 2026Updated last week
baicai-1145 / Qwen3-ASR-onnx
View on GitHub
☆18Mar 18, 2026Updated 4 months ago
QwenLM / Qwen3-ASR
View on GitHub
Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music…
☆3,214Jun 26, 2026Updated 3 weeks ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
xiaomi-research / dasheng-tokenizer
View on GitHub
State-of-the-art continious audio tokenization
☆40Mar 9, 2026Updated 4 months ago
Gilgamesh-J / X-ASR
View on GitHub
X-ASR is a series of automatic speech recognition models based on the icefall framework, focusing on streaming ASR and low-latency deploy…
☆145Jul 8, 2026Updated 2 weeks ago
antirez / qwen-asr
View on GitHub
C inference for Qwen3-ASR 0.6b and 1.7b transcriptions models
☆583Feb 17, 2026Updated 5 months ago
HaujetZhao / Chinese-ITN
View on GitHub
中文逆文本正则化 (Chinese ITN, Chinese Inverse Text Normalization) ，即将文本中的中文数字转为阿拉伯数字。
☆32Jun 10, 2026Updated last month
Wasser1462 / Qwen3-ASR-onnx
View on GitHub
A small and simple example showing how to run Qwen3-ASR with ONNX Runtime.
☆33Apr 8, 2026Updated 3 months ago
DakeQQ / Automatic-Speech-Recognition-ASR-ONNX
View on GitHub
Utilizes ONNX Runtime to transcribe audio into text.
☆85Jul 10, 2026Updated 2 weeks ago
yfyeung / DS-WED
View on GitHub
[ICASSP 2026] Official code for "Measuring Prosody Diversity in Zero-Shot TTS: A New Metric, Benchmark, and Exploration"
☆17Apr 16, 2026Updated 3 months ago
Ikaros-521 / FunASR_WS
View on GitHub
基于FunASR官方Demo修改的WS服务端，配合FastAPI提供HTTP服务，可以在浏览器中进行实时ASR测试
☆55Aug 4, 2025Updated 11 months ago
QwenLM / Qwen3-ASR-Toolkit
View on GitHub
Official Python toolkit for the Qwen3-ASR API. Parallel high‑throughput calls, robust long‑audio transcription, multi‑sample‑rate support…
☆981Feb 5, 2026Updated 5 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
leospark / FireRedVAD-Engineering
View on GitHub
Lightweight streaming Voice Activity Detection (VAD) tool with ONNX runtime
☆24Mar 18, 2026Updated 4 months ago
FireRedTeam / FireRedASR2S
View on GitHub
A SOTA Industrial-Grade All-in-One ASR system with ASR, VAD, LID, and Punc modules. FireRedASR2 supports Chinese (Mandarin, 20+ dialects/…
☆614Jun 2, 2026Updated last month
DataoceanAI / Dolphin
View on GitHub
Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.
☆776Jun 11, 2026Updated last month
lovemefan / SenseVoice.cpp
View on GitHub
Port of Funasr's Sense-voice model in C/C++
☆568Dec 19, 2025Updated 7 months ago
k2-fsa / sherpa-mlx
View on GitHub
sherpa with mlx
☆15Aug 2, 2025Updated 11 months ago
goutamyg / MVT.cpp
View on GitHub
C++ implementation of "Mobile Vision Transformer-based Visual Object Tracking" (BMVC2023) and "Separable Self and Mixed Attention Transf…
☆13Apr 23, 2024Updated 2 years ago
xinhecuican / QSmartAssistant
View on GitHub
一个模块化，全过程可离线，低占用率的对话机器人/智能音箱
☆159Mar 25, 2026Updated 4 months ago
HaujetZhao / CapsWriter-Offline
View on GitHub
PC 端语音输入工具，离线识别，高准确率、低延迟，支持热词、LLM润色。按住CapsLock或鼠标侧键X2说话，松开自动上屏。
☆6,422Jun 10, 2026Updated last month
CrispStrobe / CrispASR
View on GitHub
C++ ggml runtime hub for multilingual ASR and TTS models: Cohere Transcribe, Parakeet TDT, Voxtral, Canary 1B v2, etc, plus universal for…
☆474Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
bluryar / VoxCPM-ONNX
View on GitHub
☆49Mar 18, 2026Updated 4 months ago
k2-fsa / sherpa-onnx-go-windows
View on GitHub
sherpa-onnx Go package for Windows
☆14Jul 8, 2026Updated 2 weeks ago
DakeQQ / Text-to-Speech-TTS-ONNX
View on GitHub
Utilizes ONNX Runtime for TTS model.
☆65Jul 13, 2026Updated last week
yeahhe365 / ASR-Studio
View on GitHub
Multi-provider ASR web studio (Qwen, Doubao, Gemini, NIM, OpenAI-compatible & more) with recording, batch queue, PWA, local cache, and be…
☆273Jul 16, 2026Updated last week
jhqxxx / aha
View on GitHub
aha model inference library, now supports Qwen(2.5VL/3/3VL/3.5/ASR/3Embedding/3Reranker), MiniCPM(4/5), VoxCPM(0.5B/1.5/2), DeepSeek-OCR/…
☆381Jun 7, 2026Updated last month
zai-org / GLM-ASR
View on GitHub
GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters
☆836Mar 6, 2026Updated 4 months ago
lovemefan / paraformer.cpp
View on GitHub
Port of Funasr's Paraformer model in C/C++
☆43Jun 19, 2024Updated 2 years ago