Quantatirsk/qwen3-asr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Quantatirsk/qwen3-asr)

Quantatirsk / qwen3-asr

All in one Qwen3-ASR Server, compatible with OpenAI API

☆320

Alternatives and similar repositories for qwen3-asr

Users that are interested in qwen3-asr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fengin / Fun-ASR-Nano-2512-Deploy
View on GitHub
Fun-ASR-Nano-2512官方发布的仓库内容有点多，部署起来坑也比较多，本项目提供一个简化的部署方案。
☆151Dec 26, 2025Updated 6 months ago
uaysk / qwen3-asr-openai
View on GitHub
qwen3 asr server for openai compatible API
☆42Mar 11, 2026Updated 4 months ago
JingZhaoQi / EchoSmith
View on GitHub
☆85Mar 9, 2026Updated 4 months ago
yuekaizhang / Fun-ASR-vllm
View on GitHub
Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.
☆107Jul 7, 2026Updated 2 weeks ago
QwenLM / Qwen3-ASR
View on GitHub
Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music…
☆3,214Jun 26, 2026Updated 3 weeks ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
zaigie / FunSpeech
View on GitHub
开箱即用的本地私有化部署语音服务，快速搭建Qwen3ASR/FunASR与Qwen3TTS/CosyVoice后端
☆153Jul 6, 2026Updated 2 weeks ago
miemiekurisu / qwen3asr_cpu
View on GitHub
A high-performance C/C++ inference server for Qwen3-ASR , optimized for CPU/GPU real-time streaming speech recognition.
☆15Jun 27, 2026Updated 3 weeks ago
QwenAudio / Fun-ASR
View on GitHub
Open-source LLM-based ASR model family for Chinese, dialect, accent, and multilingual speech, with FunASR, vLLM, streaming, and llama.cpp…
☆1,425Updated this week
bbeyondllove / asr_server
View on GitHub
一个基于 Sherpa-ONNX 的高性能语音识别服务，支持实时VAD（语音活动检测）、多语言语音识别和声纹识别功能。
☆115Jan 4, 2026Updated 6 months ago
di-osc / livekit-plugins-chinese
View on GitHub
livekit agent plugins
☆47Apr 21, 2026Updated 3 months ago
lukeewin / FunASR_API
View on GitHub
这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.
☆27Jun 16, 2026Updated last month
xphh / fireredasr-streaming
View on GitHub
low-latency realtime ASR based on FireRedASR
☆62Jul 8, 2025Updated last year
HaujetZhao / asr-hotword
View on GitHub
最棒的的ASR后处理热词方案，基于音素编辑距离，实现热词替换。
☆43Jun 10, 2026Updated last month
modelscope / FunASR
View on GitHub
Open-source speech recognition toolkit for training, inference, streaming ASR, VAD, punctuation, speaker diarization pipelines, and OpenA…
☆19,459Updated this week
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
wsjj-official / Fun-ASR-Nano-2512-Docker
View on GitHub
Fun-ASR-Nano-2512的Docker版本
☆16Jan 10, 2026Updated 6 months ago
QwenLM / Qwen3-ASR-Toolkit
View on GitHub
Official Python toolkit for the Qwen3-ASR API. Parallel high‑throughput calls, robust long‑audio transcription, multi‑sample‑rate support…
☆981Feb 5, 2026Updated 5 months ago
FireRedTeam / FireRedASR2S
View on GitHub
A SOTA Industrial-Grade All-in-One ASR system with ASR, VAD, LID, and Punc modules. FireRedASR2 supports Chinese (Mandarin, 20+ dialects/…
☆614Jun 2, 2026Updated last month
pengzhendong / wetext
View on GitHub
Python runtime for WeTextProcessing (does not depend on Pynini)
☆53Jun 11, 2026Updated last month
yanlin0604 / SenseVoiceApi
View on GitHub
基于 FunASR SenseVoice 模型的实时语音识别服务，支持说话人识别、音频降噪、ASR 错误修正等高级功能。
☆20Jul 10, 2026Updated 2 weeks ago
leospark / FireRedVAD-Engineering
View on GitHub
Lightweight streaming Voice Activity Detection (VAD) tool with ONNX runtime
☆24Mar 18, 2026Updated 4 months ago
pengzhendong / streaming-sensevoice
View on GitHub
Pseudo Streaming SenseVoice with Hotwords
☆466Jun 15, 2026Updated last month
lgy1027 / matrix-live-diarizer
View on GitHub
Local-first real-time meeting transcription with speaker diarization, switchable ASR engines, and optional OpenAI-compatible LLM summarie…
☆85Updated this week
Wasser1462 / Qwen3-ASR-onnx
View on GitHub
A small and simple example showing how to run Qwen3-ASR with ONNX Runtime.
☆33Apr 8, 2026Updated 3 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
kyr0 / fast-qwen-asr-inference-vllm
View on GitHub
FastAPI to serve Qwen-ASR with streaming support. Tested. Benchmarked. Flash Attention 2. Fast & Stable.
☆15Jun 24, 2026Updated last month
oddmeta / oddasr
View on GitHub
An ASR API server for FunASR
☆55Updated this week
DataoceanAI / Dolphin
View on GitHub
Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.
☆776Jun 11, 2026Updated last month
predict-woo / qwen3-asr.cpp
View on GitHub
Implementation of Qwen3-ASR-0.6B in GGML
☆101Updated this week
Gilgamesh-J / X-ASR
View on GitHub
X-ASR is a series of automatic speech recognition models based on the icefall framework, focusing on streaming ASR and low-latency deploy…
☆145Jul 8, 2026Updated 2 weeks ago
yfyeung / CLSP
View on GitHub
[ACL 2026 Main] Open-Ended Speaking Style Modeling via Fine-Grained and Multi-Granular Contrastive Language-Speech Pre-training
☆104Apr 6, 2026Updated 3 months ago
DakeQQ / Automatic-Speech-Recognition-ASR-ONNX
View on GitHub
Utilizes ONNX Runtime to transcribe audio into text.
☆85Jul 10, 2026Updated 2 weeks ago
Quantatirsk / paddleocr-vl-api
View on GitHub
High-performance OCR microservice based on PaddleOCR-VL-0.9B (PaddleOCR-VL-1.5-0.9B) with MinerU-compatible API
☆37Jan 30, 2026Updated 5 months ago
zai-org / GLM-ASR
View on GitHub
GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters
☆836Mar 6, 2026Updated 4 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Wasser1462 / FunASR-nano-onnx
View on GitHub
A lightweight demo of FunASR-Nano using ONNX runtime.
☆83Feb 25, 2026Updated 5 months ago
k2-fsa / sherpa-onnx
View on GitHub
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…
☆13,767Updated this week
pengzhendong / speaker-diarization
View on GitHub
Offline Speaker Diarization with SenseVoice by Sherpa ONNX.
☆15Dec 23, 2024Updated last year
pengzhendong / streaming-tts-webui
View on GitHub
Streaming Text to Speech Web UI
☆22May 6, 2024Updated 2 years ago
HaujetZhao / Qwen3-ASR-GGUF
View on GitHub
将 Qwen3-ASR 的 LLM 部分导出为 GGUF，用 llama.cpp 进行加速推理。后者支持 Vulkan 和 Cuda 加速。
☆196Apr 29, 2026Updated 2 months ago
FireRedTeam / FireRedASR
View on GitHub
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR be…
☆1,940Feb 25, 2026Updated 5 months ago
modelscope / 3D-Speaker
View on GitHub
A Repository for Single- and Multi-modal Speaker Verification, Speaker Recognition and Speaker Diarization
☆3,069Dec 8, 2025Updated 7 months ago