Speech recognition API service powered by FunASR and Qwen-ASR, supporting 52 languages, compatible with OpenAI API and Alibaba Cloud Speech API. 基于 FunASR 与 Qwen3-ASR 的语音识别 API 服务,支持 52 种语言,兼容 OpenAI API 与阿里云语音 API。
☆191Mar 19, 2026Updated this week
Alternatives and similar repositories for funasr-api
Users that are interested in funasr-api are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fun-ASR-Nano-2512官方发布的仓库内容有点多,部署起来坑也比较多,本项目提供一个简化的部署方案。☆124Dec 26, 2025Updated 2 months ago
- A simple implementation for improving CosyVoice2 by GRPO method☆35Oct 17, 2025Updated 5 months ago
- ASR教程: https://dataxujing.github.io/ASR-paper/☆25Jul 1, 2024Updated last year
- Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.☆80Jan 14, 2026Updated 2 months ago
- 一个基于 Sherpa-ONNX 的高性能语音识别服务,支持实时VAD(语音活动检测)、多语言语音识别和声纹识别功能。☆84Jan 4, 2026Updated 2 months ago
- stt websockect server using sherpa-onnx☆50Feb 28, 2026Updated 3 weeks ago
- 自用。一个用PHP,基于redis的简单易用的异步任务处理demo☆15Aug 28, 2016Updated 9 years ago
- 基于Fastrtc、Ollama、FunASR和MegaTTS的大模型中文语音实时对话应用☆21Apr 26, 2025Updated 10 months ago
- 基于FastAPI的语音服务系统,集成语音合成(TTS) 和语音识别(STT)功能。使用CosyVoice2作为TTS引擎,FunASR作为STT引擎,支持零样本语音克隆、流式输出、多种语言识别等高级功能。☆20Apr 1, 2025Updated 11 months ago
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆23Feb 12, 2026Updated last month
- The baselines of ARC-Challenge-Interspeech2026☆57Dec 1, 2025Updated 3 months ago
- ☆38Apr 3, 2025Updated 11 months ago
- Code for "AtTGen: Attribute Tree Generation for Real-World Attribute Joint Extraction", ACL 2023☆13May 19, 2023Updated 2 years ago
- A simple AI/ML tool for non-technical creatives☆11May 5, 2023Updated 2 years ago
- Bridging QQ and Telegram with NapCat & mtcute. 基于 NapCat 和 mtcute 的 QQ-Telegram 消息桥☆34Updated this week
- An SSH plugin for Dify☆13Jan 16, 2026Updated 2 months ago
- 用 onnx 和 gguf 格式混合运行 Fun-ASR-Nano 模型全流程☆94Updated this week
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- 阿里SenseVoice的fastpi封装,采用onnx发布,体积更小,附带量化模型,支持GPU。支持从URL文件进行语音识别。☆108Sep 2, 2024Updated last year
- livekit agent plugins☆40Feb 19, 2026Updated last month
- Fun-CosyVoice3-0.5B-2512 语音合成服务的简化部署方案,以及快速测试和部署提供应用调用☆72Dec 24, 2025Updated 3 months ago
- Official PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative A…☆76Jun 16, 2025Updated 9 months ago
- 开源 webhook 代理服务,基于 Hono 框架和 Cloudflare Workers 构建。将 webhook 事件实时转换为 WebSocket 或 SSE 事件流。☆17Nov 15, 2025Updated 4 months ago
- ☆50Nov 26, 2023Updated 2 years ago
- Fast Search for Gzipped Log Files☆10Apr 26, 2025Updated 10 months ago
- PASE: Phonologically Anchored Speech Enhancer☆44Dec 10, 2025Updated 3 months ago
- ☆22Oct 22, 2024Updated last year
- Mini Swoole for TCP, UDP, HTTP, Websocket framework based on Swoole☆10Mar 26, 2020Updated 5 years ago
- Multi-speaker separation, identification, diarization ALL-IN-ONE. It can isolate the target speaker from a conversation audio and do ASR.☆65Oct 13, 2025Updated 5 months ago
- A playground for experimenting with acoustic echo cancellation using a microphone, speaker, and ONNX.☆13Oct 22, 2024Updated last year
- [NeurIPS 2021] Open Rule Induction☆20May 22, 2022Updated 3 years ago
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆50Feb 17, 2026Updated last month
- 是APEX贡献的一个基于大数据平台能力的数据开发平台,帮助企业以最小成本实现链接数据,构建和沉淀数仓模型,降低数据应用门槛,沉淀数据价值。☆12Oct 31, 2024Updated last year
- 国内版抖音批量下载☆10Feb 6, 2023Updated 3 years ago
- Minerva是一个便捷的音频工具,支持快速进行录音(PCM/MP3/WAV)和VAD端点检测识别,并保存活动语音。☆10May 23, 2024Updated last year
- ☆14Sep 9, 2020Updated 5 years ago
- 使用LLM大模型、langchain、fastapi、agent等技术实现ai和用户聊天,并且支持本地向量库、api接口工具,支持http sse流式输出☆18Apr 11, 2024Updated last year
- YoloV8 segmentation NPU for the RK 3566/68/88☆17Apr 30, 2024Updated last year
- astrbot的GsCore插件适配器☆34Mar 3, 2026Updated 3 weeks ago