Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.
☆18Jan 10, 2025Updated last year
Alternatives and similar repositories for llama-gguf-optimize
Users that are interested in llama-gguf-optimize are comparing it to the libraries listed below
Sorting:
- Quick access to any large language model from your browser.☆10Feb 16, 2026Updated 2 weeks ago
- Get aid from local LLMs right in your PowerShell☆15May 2, 2025Updated 10 months ago
- AI Search engine☆13Sep 24, 2025Updated 5 months ago
- 🚀 FlexLLama - Lightweight self-hosted tool for running multiple llama.cpp server instances with OpenAI v1 API compatibility and multi-GP…☆50Feb 17, 2026Updated 2 weeks ago
- Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker.☆36Jul 2, 2025Updated 8 months ago
- Simple node proxy for llama-server that enables MCP use☆17May 10, 2025Updated 9 months ago
- MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…☆14Mar 12, 2025Updated 11 months ago
- SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI. Features semantic caching, model profi…☆59Feb 27, 2026Updated last week
- ☆17Dec 16, 2024Updated last year
- ☆25Apr 26, 2025Updated 10 months ago
- Analyze Reddit posts☆30Feb 27, 2025Updated last year
- An AI Vision Language Model System for extracting structured knowledge graph information(JSON) from images of process diagrams☆40Apr 5, 2025Updated 11 months ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆35Feb 11, 2026Updated 3 weeks ago
- llama-swap + a minimal ollama compatible api☆51Feb 13, 2026Updated 3 weeks ago
- A comprehensive WebUI Toolkit for Resemble-AI's Chatterbox☆23Jun 7, 2025Updated 9 months ago
- ☆25Jan 24, 2026Updated last month
- AdaLLM is an NVFP4-first inference runtime for Ada Lovelace (RTX 4090) with FP8 KV cache and custom decode kernels. This repo targets NVF…☆96Feb 15, 2026Updated 2 weeks ago
- OpenAI compatible API for Dia-1.6B☆36Apr 27, 2025Updated 10 months ago
- Fingerprint recognition with OpenCV☆11Jul 10, 2021Updated 4 years ago
- Text to audio with Tik-Tok Voices☆13Apr 6, 2023Updated 2 years ago
- ☆14Feb 28, 2026Updated last week
- This project showcases engaging interactions between two AI chatbots.☆10Jan 10, 2024Updated 2 years ago
- Kinematic and dynamic models of continuum and articulated soft robots.☆15Nov 22, 2025Updated 3 months ago
- A dashboard for the real-time monitoring of publicly available online COVID-19 posts across Facebook, Twitter, and Instagram by health-fo…☆13Sep 19, 2023Updated 2 years ago
- Efficient computer use agent powered by Meta Llama 4 Maverick☆46Apr 17, 2025Updated 10 months ago
- 基于janus-gateway实现的流媒体服务器。☆11May 15, 2024Updated last year
- Curated list of awesome plugins for ChatGPT☆15Mar 23, 2023Updated 2 years ago
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …☆51Feb 10, 2026Updated 3 weeks ago
- Copilot with deepseek and more...☆13Mar 7, 2025Updated last year
- Enterprise-ready vector database toolkit for building searchable knowledge bases from multiple data sources. Supports multi-project manag…☆30Updated this week
- ☆10Jan 23, 2025Updated last year
- A Simple, Explainable Vision Language Model for detecting manifacturing defects into products☆14Sep 23, 2025Updated 5 months ago
- Generate random character (PCs or NPCs) backgrounds using the "Central Casting: Heroes of Legend" book☆11Sep 13, 2023Updated 2 years ago
- Home server set up☆13Oct 5, 2025Updated 5 months ago
- ☆12Jun 1, 2025Updated 9 months ago
- Simple and powerful extension for searching web and viewing website content.☆11Apr 11, 2025Updated 10 months ago
- Kind of bomberman made using Unreal Engine. Local Multiplayer☆12Apr 24, 2019Updated 6 years ago
- Automated pipeline for downloading, staging, ingesting, and investigating leaked and declassified archives (DDoSecrets, National Security…☆31Sep 22, 2025Updated 5 months ago
- ☆10Sep 29, 2024Updated last year