All in one Qwen3-ASR Server, compatible with OpenAI API
☆284May 12, 2026Updated last week
Alternatives and similar repositories for qwen3-asr
Users that are interested in qwen3-asr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Fun-ASR-Nano-2512官方发布的仓库内容有点多,部署起来坑也比较多,本项目提供一个简化的部署方案。☆144Dec 26, 2025Updated 5 months ago
- A simple implementation for improving CosyVoice2 by GRPO method☆38May 5, 2026Updated 3 weeks ago
- ASR教程: https://dataxujing.github.io/ASR-paper/☆26Jul 1, 2024Updated last year
- stt websockect server using sherpa-onnx☆53Feb 28, 2026Updated 2 months ago
- Fun-ASR is an end-to-end speech recognition large model launched by Tongyi Lab.☆97Jan 14, 2026Updated 4 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- 基于Fastrtc、Ollama、FunASR和MegaTTS的大模型中文语音实时对话应用☆22Apr 26, 2025Updated last year
- 这是基于FunASR实现的区分说话人语音识别API | This is a speaker-diarization-based speech recognition API implemented using FunASR.☆26Feb 12, 2026Updated 3 months ago
- Testing sets for semanticVAD☆20Feb 18, 2025Updated last year
- Generative Motion Latent Flow Matching for Audio-driven Talking Portrait☆33Sep 10, 2025Updated 8 months ago
- The baselines of ARC-Challenge-Interspeech2026☆59Dec 1, 2025Updated 5 months ago
- ☆39Apr 3, 2025Updated last year
- funasr语音转文字的简单api版本,funasr+fastapi,方便部署在服务器上☆13Aug 10, 2024Updated last year
- 阿里SenseVoice的fastpi封装,采用onnx发布,体积更小,附带量化模型,支持GPU。支持从URL文件进行语音识别。☆109Sep 2, 2024Updated last year
- A modern web UI for the Qwen ASR model, featuring audio recording, PWA support, Picture-in-Picture mode, and local caching for fast, accu…☆262Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official PyTorch inference code for the Interspeech 2025 paper: Efficient Speech Enhancement via Embeddings from Pre-trained Generative A…☆79Jun 16, 2025Updated 11 months ago
- pure rust implemented drawer library( api like canvas), enables AI models that do not produce raw images to generate images☆53Updated this week
- 用 onnx 和 gguf 格式混合运行 Fun-ASR-Nano 模型全流程☆138May 5, 2026Updated 3 weeks ago
- Bridging QQ and Telegram with NapCat & mtcute. 基于 NapCat 和 mtcute 的 QQ-Telegram 消息桥☆41Updated this week
- 基于gradio的极简 ragflow API 聊天Web界面☆18Mar 31, 2025Updated last year
- [ECCV 2024] SegVG: Transferring Object Bounding Box to Segmentation for Visual Grounding☆63Oct 22, 2024Updated last year
- ☆49Nov 26, 2023Updated 2 years ago
- 开源 webhook 代理服务,基于 Hono 框架和 Cloudflare Workers 构建。将 webhook 事件实时转换为 WebSocket 或 SSE 事件流。☆17Nov 15, 2025Updated 6 months ago
- 提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手☆33Apr 9, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- VocalVerse: A powerful vocal evaluation framework powered by the Qwen LLMs☆46May 11, 2026Updated 2 weeks ago
- Access any internal service from your browser. No VPN, no client, one command. Shield CLI is a browser-first internal service gateway — S…☆31Apr 24, 2026Updated last month
- A freeswitch esl server for make a callcenter core,ex:ACD,IVR and so on......☆12Sep 26, 2016Updated 9 years ago
- A playground for experimenting with acoustic echo cancellation using a microphone, speaker, and ONNX.☆13Oct 22, 2024Updated last year
- 基于Python Selenium的Flask API,用于使用Python脚本操控腾讯AI元宝网页的输入上传发送等操作并检测元宝AI的输出将其返回为json,以创建一个腾讯元宝驱动的AI API接口☆32Dec 27, 2025Updated 4 months ago
- 🧩 / 🌐 OpenAI Plugins Directory - collection that support Lobe Chat☆24Feb 5, 2026Updated 3 months ago
- FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens…☆49Feb 17, 2026Updated 3 months ago
- 是APEX贡献的一个基于大数据平台能力的数据开发平台,帮助企业以最小成本实现链接数据,构建和沉淀数仓模型,降低数据应用门槛,沉淀数据价值。☆12Oct 31, 2024Updated last year
- Minerva是一个便捷的音频工具,支持快速进行录音(PCM/MP3/WAV)和VAD端点检测识别,并保存活动语音。☆10May 23, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [Lab] lab website☆11May 18, 2026Updated last week
- ☆15Sep 9, 2020Updated 5 years ago
- The official implement of Freeze-Omni.☆15Jul 10, 2025Updated 10 months ago
- 基本算法与数据结构的Java实现☆25Nov 22, 2021Updated 4 years ago
- FreeSWITCH ASR module fork from mod_audio_stream, use FunASR online cpu version☆18Jun 27, 2025Updated 10 months ago
- MiMo-Audio: Audio Language Models are Few-Shot Learners☆1,042Mar 3, 2026Updated 2 months ago
- [ACL 2026 Main] MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows☆140Sep 2, 2025Updated 8 months ago