reinterpretcat / qwen3-rsView external linksLinks
An educational Rust project for exporting and running inference on Qwen3 LLM family
☆40Aug 3, 2025Updated 6 months ago
Alternatives and similar repositories for qwen3-rs
Users that are interested in qwen3-rs are comparing it to the libraries listed below
Sorting:
- ☆42Aug 2, 2025Updated 6 months ago
- A miniaturized version of the Kimi-K2 model optimized for deployment on single H100 GPUs.☆36Jul 16, 2025Updated 7 months ago
- Protocol for Augmented Memory of Project Artifacts (MCP compatible) - extended☆24Jan 24, 2026Updated 3 weeks ago
- Multi-agent orchestration framework for AI applications - build, deploy, and manage AI agents across the full lifecycle with Forge, Conve…☆30Jan 10, 2026Updated last month
- ☆15Feb 1, 2025Updated last year
- ☆36Updated this week
- A curated collection of persona-based mcp server & tool groupings.☆34Sep 11, 2025Updated 5 months ago
- convert pytorch model to ncnn☆13Dec 5, 2018Updated 7 years ago
- These agents work based on any local model. You ask your question and simply indicate the number of agents and experts who will answer it…☆19Feb 25, 2024Updated last year
- ☆64Jun 24, 2025Updated 7 months ago
- fully local, temporally aware natural language file search on your pc! even without a GPU. find relevant files using natural language i…☆170Dec 15, 2025Updated 2 months ago
- An Open-Source Modular AI Assistant☆32Mar 20, 2025Updated 10 months ago
- ☆34Mar 22, 2025Updated 10 months ago
- Thin wrapper around GGML to make life easier☆42Nov 5, 2025Updated 3 months ago
- Text-to-Speech (TTS) engine for the Armenian language☆12Sep 29, 2024Updated last year
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Jun 12, 2024Updated last year
- A desktop GUI for Flux 1.1 Pro built using DelphiFMX For Python☆11Oct 5, 2024Updated last year
- High-Performance Text Deduplication Toolkit☆61Aug 25, 2025Updated 5 months ago
- Inference Llama 2 in one file of zero-dependency, zero-unsafe Rust☆40Aug 2, 2023Updated 2 years ago
- Orchestrator Kit for Agentic Reasoning - OrKa is a modular AI orchestration system that transforms Large Language Models (LLMs) into comp…☆88Jan 10, 2026Updated last month
- A converter and basic tester for rwkv onnx☆43Jan 29, 2024Updated 2 years ago
- Inference RWKV v5, v6 and v7 with Qualcomm AI Engine Direct SDK☆90Feb 5, 2026Updated last week
- MLX Implementation of Recursive Reasoning with Tiny Networks☆78Oct 11, 2025Updated 4 months ago
- ☆15Aug 5, 2025Updated 6 months ago
- LlamaNet: Decentralized Inference Swarm for llama.cpp☆23Jan 18, 2026Updated 3 weeks ago
- PotPlayer launcher for Jellyfin Web • Optional : Clickable link to the local media folder • Optional : Jellyfin Server Automation :…☆17Jan 28, 2026Updated 2 weeks ago
- Produce your own Dynamic 3.0 Quants and achieve optimum accuracy & SOTA quantization performance! Input your VRAM and RAM and the toolcha…☆76Updated this week
- A free and open-source GUI tool that simplifies combining multiple code files into one, with automatic labeling and support for various p…☆14Jan 3, 2025Updated last year
- RWKV is a RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best …☆10Nov 3, 2023Updated 2 years ago
- ☆11Jan 7, 2023Updated 3 years ago
- An experimental desktop client for using Claude Desktop's MCP with Novelcrafter codices.☆10Dec 3, 2024Updated last year
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated last year
- Demonstration of Single Sign On with an OpenId provider.☆12Oct 18, 2020Updated 5 years ago
- Kotlin library for Cortex.cpp a Local AI API Platform that is used to run and customize LLMs.☆10Apr 2, 2025Updated 10 months ago
- A companion application to use SiYuan note as a knowledge base with OpenAI APIs☆15Sep 26, 2025Updated 4 months ago
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆17Feb 5, 2026Updated last week
- ☆63Jul 10, 2025Updated 7 months ago
- llm_utils: Basic LLM tools, best practices, and minimal abstraction.☆48Feb 18, 2025Updated 11 months ago
- An interactive, story-based Web Monetization tutorial for online creators.☆11Mar 1, 2025Updated 11 months ago