Speech recognition API service powered by FunASR and Qwen-ASR, supporting 52 languages, compatible with OpenAI API and Alibaba Cloud Speech API. 基于 FunASR 与 Qwen3-ASR 的语音识别 API 服务,支持 52 种语言,兼容 OpenAI API 与阿里云语音 API。
☆229Mar 31, 2026Updated 2 weeks ago
Alternatives and similar repositories for funasr-api
Users that are interested in funasr-api are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fun-ASR-Nano-2512官方发布的仓库内容有点多,部署起来坑也比较多,本项目提供一个简化的部署方案。☆131Dec 26, 2025Updated 3 months ago
- A simple implementation for improving CosyVoice2 by GRPO method☆37Oct 17, 2025Updated 5 months ago
- Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.☆88Jan 14, 2026Updated 3 months ago
- 一个基于 Sherpa-ONNX 的高性能语音识别服务,支持实时VAD(语音活动检测)、多语言语音识别和声纹识别功能。☆93Jan 4, 2026Updated 3 months ago
- 自用。一个用PHP,基于redis的简单易用的异步任务处理demo☆15Aug 28, 2016Updated 9 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- 基于Fastrtc、Ollama、FunASR和MegaTTS的大模型中文语音实时对话应用☆21Apr 26, 2025Updated 11 months ago
- 基于FastAPI的语音服务系统,集成语音合成(TTS)和语音 识别(STT)功能。使用CosyVoice2作为TTS引擎,FunASR作为STT引擎,支持零样本语音克隆、流式输出、多种语言识别等高级功能。☆20Apr 1, 2025Updated last year
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆22Feb 12, 2026Updated 2 months ago
- Testing sets for semanticVAD☆20Feb 18, 2025Updated last year
- The baselines of ARC-Challenge-Interspeech2026☆57Dec 1, 2025Updated 4 months ago
- ☆38Apr 3, 2025Updated last year
- ☆19May 4, 2025Updated 11 months ago
- Android Frida GUI Manager; Android 图形化Frida管理器☆22Nov 19, 2021Updated 4 years ago
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 阿里SenseVoice的fastpi封装,采用onnx发布,体积更小,附带量化模型,支持GPU。支持从URL文件进行语音识别。☆109Sep 2, 2024Updated last year
- livekit agent plugins☆41Feb 19, 2026Updated last month
- 开源 webhook 代理服务,基于 Hono 框架和 Cloudflare Workers 构建。将 webhook 事件实时转换为 WebSocket 或 SSE 事件流。☆16Nov 15, 2025Updated 4 months ago
- Bridging QQ and Telegram with NapCat & mtcute. 基于 NapCat 和 mtcute 的 QQ-Telegram 消息桥☆40Updated this week
- Fun-CosyVoice3-0.5B-2512 语音合成服务的简化部署方案,以及快速测试和部署提供应用调用☆77Dec 24, 2025Updated 3 months ago
- 基于gradio的极简 ragflow API 聊天Web界面☆18Mar 31, 2025Updated last year
- ☆50Nov 26, 2023Updated 2 years ago
- PASE: Phonologically Anchored Speech Enhancer☆46Updated this week
- ☆22Oct 22, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 🧩 / 🌐 OpenAI Plugins Directory - collection that support Lobe Chat☆21Feb 5, 2026Updated 2 months ago
- A playground for experimenting with acoustic echo cancellation using a microphone, speaker, and ONNX.☆13Oct 22, 2024Updated last year
- 基于Python Selenium的Flask API,用于使用Python脚本操控腾讯AI元宝网页的输入上传发送等操作并检测元宝AI的输出将其返回为json,以创建一个腾讯元宝驱动的AI API接口☆31Dec 27, 2025Updated 3 months ago
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆50Feb 17, 2026Updated last month
- A lightweight Telegram bot that bridges Claude Code to any local folder, with autostart support for remote mobile development.☆118Mar 31, 2026Updated 2 weeks ago
- Minerva是一个便捷的音频工具,支持快速进行录音(PCM/MP3/WAV)和VAD端点检测识别,并保存活动语音。☆10May 23, 2024Updated last year
- ☆14Sep 9, 2020Updated 5 years ago
- 使用LLM大模型、langchain、fastapi、agent等技术实现ai和用户聊天,并且支持本地向量库、api接口工具,支持http sse流式输出☆18Apr 11, 2024Updated 2 years ago
- YoloV8 segmentation NPU for the RK 3566/68/88☆18Apr 30, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- astrbot的GsCore插件适配器☆36Mar 18, 2026Updated 3 weeks ago
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆17Jun 27, 2025Updated 9 months ago
- ☆39Sep 25, 2025Updated 6 months ago
- MiMo-Audio: Audio Language Models are Few-Shot Learners☆1,011Mar 3, 2026Updated last month
- 使用open-webui中的pipelines技术在open-webui中调用ragflow的agent实现基于知识库的智能对话,并拥有美观的界面。☆162Oct 31, 2025Updated 5 months ago
- 基于modelscope(魔搭社区)阿里大模型的语音转文本工具☆10Feb 2, 2024Updated 2 years ago
- yolov5_obb C++ onnxruntime deployment☆10Mar 11, 2024Updated 2 years ago