Docs for GGUF quantization (unofficial)
☆400Jul 19, 2025Updated 8 months ago
Alternatives and similar repositories for gguf-docs
Users that are interested in gguf-docs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆25Sep 1, 2025Updated 6 months ago
- Writing Tools, Apple's AI-inspired app, enchants Windows, enhancing your pen with AI LLMs. One hotkey press, system-wide, fixes grammar, …☆27Jul 26, 2025Updated 7 months ago
- Neural Audio Codecs implemented in C# - DAC, SNAC, Encodec, Dia☆45Jun 11, 2025Updated 9 months ago
- Yet Another (LLM) Web UI, made with Gemini☆12Dec 25, 2024Updated last year
- Mirror from gitlab☆11Jan 9, 2021Updated 5 years ago
- Local Qwen3 LLM inference. One easy-to-understand file of C source with no dependencies.☆162Jul 5, 2025Updated 8 months ago
- Enhancing LLMs with LoRA☆211Oct 20, 2025Updated 5 months ago
- A simple, easy-to-customize pipeline for local RAG evaluation. Starter prompts and metric definitions included.☆26Jan 14, 2026Updated 2 months ago
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆50Oct 29, 2025Updated 4 months ago
- A minimal CLI tool for piping anything into an LLM.☆20Jan 1, 2026Updated 2 months ago
- For converting LLM datasets from one format into another.☆22Nov 12, 2025Updated 4 months ago
- A lightweight API that returns Nvidia GPU utilisation information.☆16Sep 23, 2025Updated 6 months ago
- A tool for humans to interact with a Chroma vector database☆16Mar 2, 2025Updated last year
- 🌳 MCTS-inspired parallel beam search for conversation optimization. Explore multiple dialogue strategies simultaneously, stress-test a…☆35Jan 18, 2026Updated 2 months ago
- Stable Diffusion in pure C/C++☆16Feb 27, 2026Updated 3 weeks ago
- LLM backed Fantasy Tribe Game☆19Nov 21, 2024Updated last year
- Produce your own Dynamic 3.0 Quants and achieve optimum accuracy & SOTA quantization performance! Input your VRAM and RAM and the toolcha…☆82Updated this week
- ☆219Oct 30, 2025Updated 4 months ago
- Metal GPU implementation of the Qwen3 transformer model on macOS with complete Apple Silicon compute shader acceleration.☆43Oct 6, 2025Updated 5 months ago
- Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts☆30Apr 30, 2025Updated 10 months ago
- ☆40Feb 25, 2026Updated last month
- llama.cpp fork with additional SOTA quants and improved performance☆22Updated this week
- JS implementations of JNI libraries for CheerpJ☆14May 13, 2024Updated last year
- An extension for oobabooga/text-generation-webui that automatically unloads and reloads your model.☆17Apr 22, 2024Updated last year
- Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc☆2,868Updated this week
- ☆10Nov 3, 2025Updated 4 months ago
- JotItNow is a AI Voice Notes App☆25Mar 6, 2025Updated last year
- Minimal web client for chatting and roleplay with AI characters☆26Aug 21, 2025Updated 7 months ago
- A cross platform App that gives you the best UX to run models locally or remotely on your own hardware☆75Mar 15, 2026Updated last week
- Opinionated agentic RAG powered by LanceDB, Pydantic AI, and Docling☆498Mar 18, 2026Updated last week
- A universal adapter including zero-copy Python bindings for Philip Turner's metal flash attention library.☆24Dec 15, 2025Updated 3 months ago
- ☆17Jun 22, 2024Updated last year
- A Rust-based, SenseVoiceSmall☆27Mar 9, 2026Updated 2 weeks ago
- A daemon that automatically manages the performance states of NVIDIA GPUs.☆116Feb 24, 2026Updated last month
- ☆29Jul 10, 2025Updated 8 months ago
- AI Based "Happiness Optimizer"☆12Oct 20, 2024Updated last year
- Docker/podman container for llama.cpp/vllm/exllamav{2,3} orchestrated using llama-swap☆18Updated this week
- Desktop application for instant AI-powered text transformation. Translate, correct, summarize, and change the tone of any text, anywhere,…☆29Dec 29, 2025Updated 2 months ago
- ☆43Aug 2, 2025Updated 7 months ago