reinterpretcat / qwen3-rsLinks
An educational Rust project for exporting and running inference on Qwen3 LLM family
☆29Updated 2 months ago
Alternatives and similar repositories for qwen3-rs
Users that are interested in qwen3-rs are comparing it to the libraries listed below
Sorting:
- Super-simple, fully Rust powered "memory" (doc store + semantic search) for LLM projects, semantic search, etc.☆62Updated 2 years ago
- AI Assistant☆20Updated 5 months ago
- Light WebUI for lm.rs☆24Updated 11 months ago
- A Pure Rust based LLM (Any LLM based MLLM such as Spark-TTS) Inference Engine, powering by Candle framework.☆167Updated 2 weeks ago
- A simple, CUDA or CPU powered, library for creating vector embeddings using Candle and models from Hugging Face☆41Updated last year
- Official Rust Implementation of Model2Vec☆138Updated last week
- A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.☆96Updated 3 months ago
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆18Updated last month
- Local Qwen3 LLM inference. One easy-to-understand file of C source with no dependencies.☆130Updated 3 months ago
- Built for demanding AI workflows, this gateway offers low-latency, provider-agnostic access, ensuring your AI applications run smoothly a…☆78Updated 4 months ago
- *NIX SHELL with Local AI/LLM integration☆23Updated 7 months ago
- Rust implementation of Surya☆60Updated 7 months ago
- Yet another `llama.cpp` Rust wrapper☆12Updated last year
- git-like rag pipeline☆244Updated 2 weeks ago
- Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…☆80Updated last week
- A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust☆79Updated last year
- The Easiest Rust Interface for Local LLMs and an Interface for Deterministic Signals from Probabilistic LLM Vibes☆237Updated 2 months ago
- Wraps any OpenAI API interface as Responses with MCPs support so it supports Codex. Adding any missing stateful features. Ollama and Vllm…☆110Updated 3 months ago
- A relatively basic implementation of RWKV in Rust written by someone with very little math and ML knowledge. Supports 32, 8 and 4 bit eva…☆94Updated 2 years ago
- This repository has code for fine-tuning LLMs with GRPO specifically for Rust Programming using cargo as feedback☆107Updated 7 months ago
- ollama like cli tool for MLX models on huggingface (pull, rm, list, show, serve etc.)☆104Updated last week
- fast state-of-the-art speech models and a runtime that runs anywhere 💥☆57Updated 4 months ago
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Updated 8 months ago
- OpenAI compatible API for serving LLAMA-2 model☆218Updated last year
- ☆23Updated 8 months ago
- A Fish Speech implementation in Rust, with Candle.rs☆98Updated 4 months ago
- AirLLM 70B inference with single 4GB GPU☆14Updated 3 months ago
- ☆13Updated last month
- Implementing the BitNet model in Rust☆39Updated last year
- A simple, "Ollama-like" tool for managing and running GGUF language models from your terminal.☆21Updated last month