adriancable / qwen3.cLinks

Local Qwen3 LLM inference. One easy-to-understand file of C source with no dependencies.

☆83

Alternatives and similar repositories for qwen3.c

Users that are interested in qwen3.c are comparing it to the libraries listed below

Sorting:

jd-3d / SOLOBench
☆131Updated 2 months ago
TheProxyCompany / proxy-structuring-engine
Guaranteed Structured Output from any Language Model via Hierarchical State Machines
☆140Updated last month
theroyallab / YALS
☆79Updated this week
inferx-net / inferx
InferX is a Inference Function as a Service Platform
☆115Updated 2 weeks ago
NimbleEdge / sparse_transformers
Sparse Inferencing for transformer based LLMs
☆193Updated this week
perk11 / large-model-proxy
Run multiple resource-heavy Large Models (LM) on the same machine with limited amount of VRAM/other resources by exposing them on differe…
☆67Updated 2 weeks ago
chigkim / Ollama-MMLU-Pro
☆95Updated 6 months ago
leafspark / AutoGGUF
automatically quant GGUF models
☆185Updated last week
mzbac / mlx_sharding
Distributed Inference for mlx LLm
☆93Updated 11 months ago
LostRuins / datasetexplorer
Easily view and modify JSON datasets for large language models
☆77Updated last month
av / klmbr
klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs
☆78Updated 9 months ago
matt-c1 / llama-3-quant-comparison
Comparison of the output quality of quantization methods, using Llama 3, transformers, GGUF, EXL2.
☆156Updated last year
rafacelente / bllama
1.58-bit LLaMa model
☆81Updated last year
nath1295 / MLX-Textgen
A python package for serving LLM on OpenAI-compatible API endpoints with prompt caching using MLX.
☆89Updated 2 weeks ago
TC-Zheng / ActuosusAI
AI management tool
☆118Updated 8 months ago
avarayr / suaveui
Open source LLM UI, compatible with all local LLM providers.
☆175Updated 9 months ago
sam-paech / antislop-sampler
☆307Updated 3 months ago
abgulati / hf-waitress
Serving LLMs in the HF-Transformers format via a PyFlask API
☆71Updated 10 months ago
matatonic / openedai-vision
An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.
☆257Updated 4 months ago
matteoserva / GraphLLM
☆204Updated last month
epolewski / EricLLM
A fast batching API to serve LLM models
☆183Updated last year
willccbb / mlx_parallm
Fast parallel LLM inference for MLX
☆198Updated last year
thomasgauthier / LoRD
Low-Rank adapter extraction for fine-tuned transformers models
☆173Updated last year
KartDriver / mira_converse
☆80Updated 4 months ago
SlerpE / highCompute.py
☆28Updated last month
severian42 / Cascade-of-Semantically-Integrated-Layers
CaSIL is an advanced natural language processing system that implements a sophisticated four-layer semantic analysis architecture. It pro…
☆66Updated 8 months ago
rombodawg / Easy_training
☆49Updated 4 months ago
turboderp-org / exllamav3
An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs
☆430Updated this week
remichu-ai / gallama
☆131Updated 2 months ago
tdrussell / qlora-pipe
A pipeline parallel training script for LLMs.
☆152Updated 2 months ago