Quantatirsk/funasr-api

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Quantatirsk/funasr-api)

Quantatirsk / funasr-api

Speech recognition API service powered by FunASR and Qwen-ASR, supporting 52 languages, compatible with OpenAI API and Alibaba Cloud Speech API. 基于 FunASR 与 Qwen3-ASR 的语音识别 API 服务，支持 52 种语言，兼容 OpenAI API 与阿里云语音 API。

☆229

Alternatives and similar repositories for funasr-api

Users that are interested in funasr-api are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fengin / Fun-ASR-Nano-2512-Deploy
View on GitHub
Fun-ASR-Nano-2512官方发布的仓库内容有点多，部署起来坑也比较多，本项目提供一个简化的部署方案。
☆131Dec 26, 2025Updated 3 months ago
ryuclc / CosyVoice2-GRPO
View on GitHub
A simple implementation for improving CosyVoice2 by GRPO method
☆37Oct 17, 2025Updated 5 months ago
yuekaizhang / Fun-ASR-vllm
View on GitHub
Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.
☆88Jan 14, 2026Updated 3 months ago
bbeyondllove / asr_server
View on GitHub
一个基于 Sherpa-ONNX 的高性能语音识别服务，支持实时VAD（语音活动检测）、多语言语音识别和声纹识别功能。
☆93Jan 4, 2026Updated 3 months ago
liukelin / php_task_queue
View on GitHub
自用。一个用PHP，基于redis的简单易用的异步任务处理demo
☆15Aug 28, 2016Updated 9 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jkin8010 / fastrtc-talking-more
View on GitHub
基于Fastrtc、Ollama、FunASR和MegaTTS的大模型中文语音实时对话应用
☆21Apr 26, 2025Updated 11 months ago
yanghan-cyber / audio-service
View on GitHub
基于FastAPI的语音服务系统，集成语音合成(TTS)和语音识别(STT)功能。使用CosyVoice2作为TTS引擎，FunASR作为STT引擎，支持零样本语音克隆、流式输出、多种语言识别等高级功能。
☆20Apr 1, 2025Updated last year
lukeewin / FunASR_API
View on GitHub
这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.
☆22Feb 12, 2026Updated 2 months ago
semanticVAD / testsets
View on GitHub
Testing sets for semanticVAD
☆20Feb 18, 2025Updated last year
Audio-Reasoning-Challenge / Audio-Reasoning-Challenge-Baselines
View on GitHub
The baselines of ARC-Challenge-Interspeech2026
☆57Dec 1, 2025Updated 4 months ago
yuriak / SpeechDialogueFactory
View on GitHub
☆38Apr 3, 2025Updated last year
kevintsai1202 / difyplugin
View on GitHub
☆19May 4, 2025Updated 11 months ago
icespite / FridaHooker
View on GitHub
Android Frida GUI Manager; Android 图形化Frida管理器
☆22Nov 19, 2021Updated 4 years ago
jundaychan / funasr-fastapi
View on GitHub
funasr语音转文字的简单api版本，funasr+fastapi，方便部署在服务器上
☆13Aug 10, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
HG-ha / SenseVoice-Api
View on GitHub
阿里SenseVoice的fastpi封装，采用onnx发布，体积更小，附带量化模型，支持GPU。支持从URL文件进行语音识别。
☆109Sep 2, 2024Updated last year
di-osc / livekit-plugins-chinese
View on GitHub
livekit agent plugins
☆41Feb 19, 2026Updated last month
lc-cn / webhook-proxy
View on GitHub
开源 webhook 代理服务，基于 Hono 框架和 Cloudflare Workers 构建。将 webhook 事件实时转换为 WebSocket 或 SSE 事件流。
☆16Nov 15, 2025Updated 4 months ago
NapGram / NapGram
View on GitHub
Bridging QQ and Telegram with NapCat & mtcute. 基于 NapCat 和 mtcute 的 QQ-Telegram 消息桥
☆40Updated this week
fengin / Fun-CosyVoice3-0.5B-2512-Deploy
View on GitHub
Fun-CosyVoice3-0.5B-2512 语音合成服务的简化部署方案，以及快速测试和部署提供应用调用
☆77Dec 24, 2025Updated 3 months ago
zstar1003 / Simple-Ragflow
View on GitHub
基于gradio的极简 ragflow API 聊天Web界面
☆18Mar 31, 2025Updated last year
HaujetZhao / FunASR-Online-Paraformer-Test
View on GitHub
☆50Nov 26, 2023Updated 2 years ago
cisco-open / pase
View on GitHub
PASE: Phonologically Anchored Speech Enhancer
☆46Updated this week
mundane799699 / AI-TODO
View on GitHub
☆22Oct 22, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
lobehub / lobe-openai-plugins
View on GitHub
🧩 / 🌐 OpenAI Plugins Directory - collection that support Lobe Chat
☆21Feb 5, 2026Updated 2 months ago
PINTO0309 / onnx-aec
View on GitHub
A playground for experimenting with acoustic echo cancellation using a microphone, speaker, and ONNX.
☆13Oct 22, 2024Updated last year
TW-SkyHope / TxYuanbao-To-PyAPI
View on GitHub
基于Python Selenium的Flask API，用于使用Python脚本操控腾讯AI元宝网页的输入上传发送等操作并检测元宝AI的输出将其返回为json，以创建一个腾讯元宝驱动的AI API接口
☆31Dec 27, 2025Updated 3 months ago
jingzhunxue / FlowMirror_HydraVox
View on GitHub
FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…
☆50Feb 17, 2026Updated last month
terranc / claude-telegram-bot-bridge
View on GitHub
A lightweight Telegram bot that bridges Claude Code to any local folder, with autostart support for remote mobile development.
☆118Mar 31, 2026Updated 2 weeks ago
LZ9 / Minerva
View on GitHub
Minerva是一个便捷的音频工具，支持快速进行录音（PCM/MP3/WAV）和VAD端点检测识别，并保存活动语音。
☆10May 23, 2024Updated last year
yqli2420 / noisex-92
View on GitHub
☆14Sep 9, 2020Updated 5 years ago
gzlliyu / chatStreamAiAgent
View on GitHub
使用LLM大模型、langchain、fastapi、agent等技术实现ai和用户聊天，并且支持本地向量库、api接口工具，支持http sse流式输出
☆18Apr 11, 2024Updated 2 years ago
Qengineering / YoloV8-seg-NPU
View on GitHub
YoloV8 segmentation NPU for the RK 3566/68/88
☆18Apr 30, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
KimigaiiWuyi / astrbot_plugin_gscore_adapter
View on GitHub
astrbot的GsCore插件适配器
☆36Mar 18, 2026Updated 3 weeks ago
zhoutuan / mod_funasr
View on GitHub
FreeSWITCH ASR module fork from mod_audio_stream， use FunASR online cpu version
☆17Jun 27, 2025Updated 9 months ago
xkx-hub / KALL-E
View on GitHub
☆39Sep 25, 2025Updated 6 months ago
XiaomiMiMo / MiMo-Audio
View on GitHub
MiMo-Audio: Audio Language Models are Few-Shot Learners
☆1,011Mar 3, 2026Updated last month
luyilong2015 / open-webui-pipeline-for-ragflow
View on GitHub
使用open-webui中的pipelines技术在open-webui中调用ragflow的agent实现基于知识库的智能对话，并拥有美观的界面。
☆162Oct 31, 2025Updated 5 months ago
ByronLeeeee / SimpleSpeechTranscription
View on GitHub
基于modelscope（魔搭社区）阿里大模型的语音转文本工具
☆10Feb 2, 2024Updated 2 years ago
chuzhixing / yolov5_obb_onnxruntime_deploy
View on GitHub
yolov5_obb C++ onnxruntime deployment
☆10Mar 11, 2024Updated 2 years ago