Docs for GGUF quantization (unofficial)
☆464Jul 19, 2025Updated 10 months ago
Alternatives and similar repositories for gguf-docs
Users that are interested in gguf-docs are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- a character-ai like UI for LLM☆10Dec 3, 2024Updated last year
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆25Sep 1, 2025Updated 8 months ago
- Writing Tools, Apple's AI-inspired app, enchants Windows, enhancing your pen with AI LLMs. One hotkey press, system-wide, fixes grammar, …☆28Jul 26, 2025Updated 10 months ago
- Yet Another (LLM) Web UI, made with Gemini☆12Dec 25, 2024Updated last year
- Local Qwen3 LLM inference. One easy-to-understand file of C source with no dependencies.☆176Jul 5, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- llama.cpp fork with additional SOTA quants and improved performance☆2,554May 23, 2026Updated last week
- Enhancing LLMs with LoRA☆222Oct 20, 2025Updated 7 months ago
- A simple, easy-to-customize pipeline for local RAG evaluation. Starter prompts and metric definitions included.☆24Jan 14, 2026Updated 4 months ago
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆52Oct 29, 2025Updated 7 months ago
- An OpenVoice-based voice cloning tool, single executable file (~14M), supporting multiple formats without dependencies on ffmpeg, Python,…☆49Jan 18, 2026Updated 4 months ago
- For converting LLM datasets from one format into another.☆22Nov 12, 2025Updated 6 months ago
- A minimal CLI tool for piping anything into an LLM.☆21Jan 1, 2026Updated 4 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆60Dec 1, 2024Updated last year
- 🌳 MCTS-inspired parallel beam search for conversation optimization. Explore multiple dialogue strategies simultaneously, stress-test a…☆36Jan 18, 2026Updated 4 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Stable Diffusion in pure C/C++☆16Feb 27, 2026Updated 3 months ago
- [LEGACY] LWJGL 2.X - The Lightweight Java Game Library.☆11Jan 17, 2022Updated 4 years ago
- ☆244Oct 30, 2025Updated 6 months ago
- OpenZIM MCP is a modern, secure, and high-performance MCP (Model Context Protocol) server that enables AI models to access and search ZIM…☆59May 22, 2026Updated last week
- An extension for oobabooga/text-generation-webui that automatically unloads and reloads your model.☆17Apr 22, 2024Updated 2 years ago
- JS implementations of JNI libraries for CheerpJ☆13May 13, 2024Updated 2 years ago
- ☆41Feb 25, 2026Updated 3 months ago
- Local LLM-assisted text completion for Qt Creator.☆66Apr 16, 2026Updated last month
- JotItNow is a AI Voice Notes App☆25Mar 6, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Minimal web client for chatting and roleplay with AI characters☆26Aug 21, 2025Updated 9 months ago
- ☆17Jun 22, 2024Updated last year
- A cross platform App that gives you the best UX to run models locally or remotely on your own hardware☆81Mar 22, 2026Updated 2 months ago
- Produce your own Dynamic 3.0 Quants and achieve optimum accuracy & SOTA quantization performance! Input a target size and the toolchain w…☆132Updated this week
- A universal adapter including zero-copy Python bindings for Philip Turner's metal flash attention library.☆26May 12, 2026Updated 2 weeks ago
- Opinionated agentic RAG powered by LanceDB, Pydantic AI, and Docling☆528Updated this week
- Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc☆4,240Updated this week
- AI Based "Happiness Optimizer"☆12Oct 20, 2024Updated last year
- Bro scripts to monitor for new hosts within a subnet range that aren't whitelisted/vetted.☆13Jun 28, 2013Updated 12 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- An experimental node☆23Jan 13, 2026Updated 4 months ago
- Battery level of WH-1000XM4 headphones and other series models, based on the WMI wrapper for Plug-and-Play devices.☆15Dec 30, 2025Updated 4 months ago
- A minimal tool to generate and validate datasets.☆27Mar 8, 2026Updated 2 months ago
- Desktop application for instant AI-powered text transformation. Translate, correct, summarize, and change the tone of any text, anywhere,…☆33Dec 29, 2025Updated 5 months ago
- Kubernetes operator for local LLM inference with llama.cpp, vLLM, TGI, and mlx-server — multi-GPU NVIDIA + Apple Silicon Metal, autoscali…☆109Updated this week
- Awesome LLM speech-to-speech models and frameworks☆55Nov 17, 2025Updated 6 months ago
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated 2 years ago