callbacked / qwen3-mcpLinks
An MCP-enabled Qwen3 0.6B demo with adjustable thinking budget, all in your browser!
☆25Updated 4 months ago
Alternatives and similar repositories for qwen3-mcp
Users that are interested in qwen3-mcp are comparing it to the libraries listed below
Sorting:
- A Multi-Agentic AI Assistant/Builder☆24Updated last month
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆57Updated 10 months ago
- ☆24Updated 8 months ago
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …☆46Updated last month
- 🎮 Material You TUI for monitoring NVIDIA GPUs☆57Updated 4 months ago
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Updated 8 months ago
- An fully autonomous agent that accesses the browser and performs tasks.☆16Updated 5 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31Updated 5 months ago
- A real-time shared memory layer for multi-agent LLM systems.☆48Updated 3 months ago
- ☆18Updated 3 months ago
- *NIX SHELL with Local AI/LLM integration☆23Updated 7 months ago
- A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.☆16Updated 5 months ago
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Updated last year
- ☆62Updated 3 months ago
- Make Qwen3 Think like Gemini 2.5 Pro | Open webui function☆23Updated 5 months ago
- LexiCrawler is a powerful Go-based web crawling API meticulously designed to extract, clean, and transform web page content into a pristi…☆48Updated 7 months ago
- Complex RAG backend☆29Updated last year
- ☆13Updated 6 months ago
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13Updated last year
- ☆22Updated 8 months ago
- RetroChat is a powerful command-line interface for interacting with various AI language models. It provides a seamless experience for eng…☆81Updated 3 months ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆81Updated last week
- Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…☆51Updated 5 months ago
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆96Updated 3 months ago
- A web application that converts speech to speech 100% private☆76Updated 4 months ago
- ☆48Updated 7 months ago
- ☆60Updated 3 months ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated last year
- run ollama & gguf easily with a single command☆52Updated last year
- ☆14Updated 8 months ago