iuliaturc / gguf-docsView external linksLinks
Docs for GGUF quantization (unofficial)
☆367Jul 19, 2025Updated 6 months ago
Alternatives and similar repositories for gguf-docs
Users that are interested in gguf-docs are comparing it to the libraries listed below
Sorting:
- Writing Tools, Apple's AI-inspired app, enchants Windows, enhancing your pen with AI LLMs. One hotkey press, system-wide, fixes grammar, …☆26Jul 26, 2025Updated 6 months ago
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆23Sep 1, 2025Updated 5 months ago
- Produce your own Dynamic 3.0 Quants and achieve optimum accuracy & SOTA quantization performance! Input your VRAM and RAM and the toolcha…☆76Updated this week
- Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts☆30Apr 30, 2025Updated 9 months ago
- Mirror from gitlab☆11Jan 9, 2021Updated 5 years ago
- A minimal CLI tool for piping anything into an LLM.☆18Jan 1, 2026Updated last month
- Minimal web client for chatting and roleplay with AI characters☆26Aug 21, 2025Updated 5 months ago
- A lightweight API that returns Nvidia GPU utilisation information.☆15Sep 23, 2025Updated 4 months ago
- A tool for humans to interact with a Chroma vector database☆16Mar 2, 2025Updated 11 months ago
- 🌳 MCTS-inspired parallel beam search for conversation optimization. Explore multiple dialogue strategies simultaneously, stress-test a…☆35Jan 18, 2026Updated 3 weeks ago
- Yet Another (LLM) Web UI, made with Gemini☆12Dec 25, 2024Updated last year
- AI Based "Happiness Optimizer"☆12Oct 20, 2024Updated last year
- Neural Audio Codecs implemented in C# - DAC, SNAC, Encodec, Dia☆45Jun 11, 2025Updated 8 months ago
- A small repo to release coursier using self-hosted Mac M1 runner☆10Updated this week
- llama.cpp fork with additional SOTA quants and improved performance☆1,605Feb 8, 2026Updated last week
- Distillations and expansions on Rocktastic12a☆16Feb 5, 2026Updated last week
- Video plugin for Mupen64Plus v2.0, based on the Arachnoid plugin for Project64.☆19May 1, 2025Updated 9 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Dec 1, 2024Updated last year
- Web UI for working with large language models☆38Jun 13, 2024Updated last year
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13May 5, 2024Updated last year
- Awesome LLM speech-to-speech models and frameworks☆39Nov 17, 2025Updated 2 months ago
- AI Assistant☆20Apr 18, 2025Updated 9 months ago
- LLM backed Fantasy Tribe Game☆19Nov 21, 2024Updated last year
- JotItNow is a AI Voice Notes App☆24Mar 6, 2025Updated 11 months ago
- A llama.cpp simple wrapper in Swift☆18Nov 9, 2025Updated 3 months ago
- Stable Diffusion in pure C/C++☆16Jan 11, 2026Updated last month
- An extension for oobabooga/text-generation-webui that automatically unloads and reloads your model.☆17Apr 22, 2024Updated last year
- Enhancing LLMs with LoRA☆207Oct 20, 2025Updated 3 months ago
- Transplants vocabulary between language models, enabling the creation of draft models for speculative decoding WITHOUT retraining.☆49Oct 29, 2025Updated 3 months ago
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Jan 5, 2026Updated last month
- Turn any Kiwix ZIM archive (offline Wikipedia, Stack Exchange, DevDocs, etc.) into an instant knowledge source for LLMs with a tiny CLI +…☆73Jun 4, 2025Updated 8 months ago
- ☆23Dec 9, 2025Updated 2 months ago
- an open source ai stylist☆77Jul 2, 2025Updated 7 months ago
- An educational Rust project for exporting and running inference on Qwen3 LLM family☆40Aug 3, 2025Updated 6 months ago
- An OpenVoice-based voice cloning tool, single executable file (~14M), supporting multiple formats without dependencies on ffmpeg, Python,…☆44Jan 18, 2026Updated 3 weeks ago
- ☆39Sep 24, 2025Updated 4 months ago
- Metal GPU implementation of the Qwen3 transformer model on macOS with complete Apple Silicon compute shader acceleration.☆41Oct 6, 2025Updated 4 months ago
- Reliable model swapping for any local OpenAI/Anthropic compatible server - llama.cpp, vllm, etc☆2,374Feb 8, 2026Updated last week
- A professional-grade interface for Qwen3-TTS, designed to unlock the model's full potential with fine-grained control and intuitive workf…☆177Updated this week