An educational Rust project for exporting and running inference on Qwen3 LLM family
☆40Aug 3, 2025Updated 7 months ago
Alternatives and similar repositories for qwen3-rs
Users that are interested in qwen3-rs are comparing it to the libraries listed below
Sorting:
- ☆42Aug 2, 2025Updated 7 months ago
- A MCP stdio toolpack for local LLMs☆22Oct 6, 2025Updated 5 months ago
- A miniaturized version of the Kimi-K2 model optimized for deployment on single H100 GPUs.☆36Jul 16, 2025Updated 7 months ago
- Protocol for Augmented Memory of Project Artifacts (MCP compatible) - extended☆24Jan 24, 2026Updated last month
- LLamaHTML is a simple html file to communicate with a running llamacpp llama-server☆22Aug 5, 2025Updated 7 months ago
- Multi-agent orchestration framework for AI applications - build, deploy, and manage AI agents across the full lifecycle with Forge, Conve…☆30Jan 10, 2026Updated last month
- ☆15Feb 1, 2025Updated last year
- A curated collection of persona-based mcp server & tool groupings.☆36Sep 11, 2025Updated 5 months ago
- Load and run Llama from safetensors files in C☆15Oct 24, 2024Updated last year
- convert pytorch model to ncnn☆13Dec 5, 2018Updated 7 years ago
- These agents work based on any local model. You ask your question and simply indicate the number of agents and experts who will answer it…☆19Feb 25, 2024Updated 2 years ago
- ☆64Jun 24, 2025Updated 8 months ago
- Provider protocol, provider SDK and 1st party providers☆17Jul 10, 2024Updated last year
- fully local, temporally aware natural language file search on your pc! even without a GPU. find relevant files using natural language i…☆172Dec 15, 2025Updated 2 months ago
- Cross-Platform High-Level LLM Library☆52Updated this week
- ☆46Feb 19, 2026Updated 2 weeks ago
- LLM inference in C/C++☆23Oct 4, 2024Updated last year
- ☆24Jan 22, 2025Updated last year
- A sleek web interface for Ollama, making local LLM management and usage simple. WebOllama provides an intuitive UI to manage Ollama model…☆71Oct 8, 2025Updated 5 months ago
- An Open-Source Modular AI Assistant☆32Mar 20, 2025Updated 11 months ago
- ☆35Mar 22, 2025Updated 11 months ago
- Thin wrapper around GGML to make life easier☆42Nov 5, 2025Updated 4 months ago
- Text-to-Speech (TTS) engine for the Armenian language☆12Sep 29, 2024Updated last year
- Natural language control for Python CLI tools using locally-trained SLMs (CPU inference)☆30Feb 21, 2026Updated 2 weeks ago
- A desktop GUI for Flux 1.1 Pro built using DelphiFMX For Python☆11Oct 5, 2024Updated last year
- A Chrome extension that enables virtual fashion try-on and model swap using FASHN AI. Hover over fashion images on any website to: (1) tr…☆21Aug 14, 2025Updated 6 months ago
- High-Performance Text Deduplication Toolkit☆62Aug 25, 2025Updated 6 months ago
- Orchestrator Kit for Agentic Reasoning - OrKa is a modular AI orchestration system that transforms Large Language Models (LLMs) into comp…☆90Feb 25, 2026Updated last week
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆40Aug 2, 2023Updated 2 years ago
- A converter and basic tester for rwkv onnx☆43Jan 29, 2024Updated 2 years ago
- deep hermes, but decides how to respond based on its OWN decision, no need for system prompts.☆40Apr 1, 2025Updated 11 months ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆90Feb 14, 2026Updated 3 weeks ago
- An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.☆10Dec 3, 2024Updated last year
- ☆11Jan 7, 2023Updated 3 years ago
- MLX Implementation of Recursive Reasoning with Tiny Networks☆78Oct 11, 2025Updated 4 months ago
- Kotlin library for Cortex.cpp a Local AI API Platform that is used to run and customize LLMs.☆10Apr 2, 2025Updated 11 months ago
- A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Face☆47May 3, 2024Updated last year
- LlamaNet: Decentralized Inference Swarm for llama.cpp☆23Jan 18, 2026Updated last month
- Demonstration of Single Sign On with an OpenId provider.☆12Oct 18, 2020Updated 5 years ago