QwenLM / Qwen3-ASR-ToolkitLinks
Official Python toolkit for the Qwen3-ASR API. Parallel high‑throughput calls, robust long‑audio transcription, multi‑sample‑rate support.
☆713Updated last month
Alternatives and similar repositories for Qwen3-ASR-Toolkit
Users that are interested in Qwen3-ASR-Toolkit are comparing it to the libraries listed below
Sorting:
- ☆530Updated 2 months ago
- A powerful 3B-parameter, LLM-based Reinforcement Learning audio edit model excels at editing emotion, speaking style, and paralinguistics…☆744Updated last week
- MiMo-Audio: Audio Language Models are Few-Shot Learners☆871Updated 2 months ago
- ☆642Updated last month
- An open-source implementation of Whisper☆468Updated last month
- ☆472Updated 6 months ago
- ☆635Updated last month
- ☆1,047Updated last month
- Official MiniMax Model Context Protocol (MCP) server that enables interaction with powerful Text to Speech, image generation and video ge…☆1,108Updated 3 weeks ago
- A real-time Electron-based desktop GUI for DeepSeek-OCR☆696Updated last month
- Googles NotebookLM but local☆651Updated 2 months ago
- Open-source framework for developing real-time multimodal conversational AI agents.☆543Updated last week
- TTS model capable of streaming conversational audio in realtime.☆874Updated last week
- ☆476Updated 7 months ago
- Model-agnostic plug-n-play LangChain/LangGraph agents powered entirely by MCP tools over HTTP/SSE.☆769Updated last month
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆297Updated 6 months ago
- A quick vibe coded app for deepseek OCR☆1,511Updated 3 weeks ago
- Generate Web Pages and Components with text prompts, with Local Models. (or Cloud Models, if you want) - now supports Thinking Models!☆397Updated 5 months ago
- Build, enrich, and transform datasets using AI models with no code☆1,591Updated last month
- Omnilingual ASR Open-Source Multilingual SpeechRecognition for 1600+ Languages☆2,424Updated 3 weeks ago
- Lightning-Fast, On-Device TTS — running natively via ONNX.☆1,756Updated this week
- Learn to build and deploy local Visual Language Models for Edge AI☆329Updated last month
- GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters☆382Updated this week
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆285Updated 2 months ago
- ☆171Updated 3 months ago
- Open source workflow automation platform built for developers - full observability and code exportability!☆669Updated this week
- VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning☆2,651Updated this week
- VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)☆778Updated last week
- ☆787Updated last month
- A minimal yet professional single agent demo project that showcases the core execution pipeline and production-grade features of agents.☆786Updated last week