robbiemu / llama-gguf-optimizeView external linksLinks
Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.
☆18Jan 10, 2025Updated last year
Alternatives and similar repositories for llama-gguf-optimize
Users that are interested in llama-gguf-optimize are comparing it to the libraries listed below
Sorting:
- Get aid from local LLMs right in your PowerShell☆15May 2, 2025Updated 9 months ago
- Quick access to any large language model from your browser.☆10Sep 20, 2024Updated last year
- AI Search engine☆13Sep 24, 2025Updated 4 months ago
- Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker.☆36Jul 2, 2025Updated 7 months ago
- 🚀 FlexLLama - Lightweight self-hosted tool for running multiple llama.cpp server instances with OpenAI v1 API compatibility and multi-GP…☆50Nov 26, 2025Updated 2 months ago
- MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…☆14Mar 12, 2025Updated 11 months ago
- Simple node proxy for llama-server that enables MCP use☆17May 10, 2025Updated 9 months ago
- ☆17Dec 16, 2024Updated last year
- ☆25Apr 26, 2025Updated 9 months ago
- An AI Vision Language Model System for extracting structured knowledge graph information(JSON) from images of process diagrams☆41Apr 5, 2025Updated 10 months ago
- Analyze Reddit posts☆29Feb 27, 2025Updated 11 months ago
- llama-swap + a minimal ollama compatible api☆46Jan 23, 2026Updated 3 weeks ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆35Oct 21, 2025Updated 3 months ago
- A comprehensive WebUI Toolkit for Resemble-AI's Chatterbox☆22Jun 7, 2025Updated 8 months ago
- OpenAI compatible API for Dia-1.6B☆36Apr 27, 2025Updated 9 months ago
- This project showcases engaging interactions between two AI chatbots.☆10Jan 10, 2024Updated 2 years ago
- Fingerprint recognition with OpenCV☆11Jul 10, 2021Updated 4 years ago
- Kinematic and dynamic models of continuum and articulated soft robots.☆15Nov 22, 2025Updated 2 months ago
- A dashboard for the real-time monitoring of publicly available online COVID-19 posts across Facebook, Twitter, and Instagram by health-fo…☆13Sep 19, 2023Updated 2 years ago
- Text to audio with Tik-Tok Voices☆13Apr 6, 2023Updated 2 years ago
- A powerful MCP testing tool with multi-provider LLM support (Ollama, OpenAI, Claude, Gemini). Test, debug, and develop MCP servers with a…☆18Jan 7, 2026Updated last month
- ☆12Jun 1, 2025Updated 8 months ago
- Efficient computer use agent powered by Meta Llama 4 Maverick☆46Apr 17, 2025Updated 9 months ago
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …☆50Updated this week
- Technical docs to help you make you Halo Strix WORK!☆23Jan 10, 2026Updated last month
- Automated pipeline for downloading, staging, ingesting, and investigating leaked and declassified archives (DDoSecrets, National Security…☆31Sep 22, 2025Updated 4 months ago
- 基于janus-gateway实现的流媒体服务器。☆11May 15, 2024Updated last year
- Home server set up☆13Oct 5, 2025Updated 4 months ago
- ☆10Jan 23, 2025Updated last year
- Generate random character (PCs or NPCs) backgrounds using the "Central Casting: Heroes of Legend" book☆11Sep 13, 2023Updated 2 years ago
- ☆10Sep 29, 2024Updated last year
- Standalone desktop application for Text-to-Speech (TTS) utilizing the Kokoro-82M AI model for pdf files☆28Updated this week
- Simple and powerful extension for searching web and viewing website content.☆11Apr 11, 2025Updated 10 months ago
- Enterprise-ready vector database toolkit for building searchable knowledge bases from multiple data sources. Supports multi-project manag…☆27Updated this week
- Curated list of awesome plugins for ChatGPT☆15Mar 23, 2023Updated 2 years ago
- Kind of bomberman made using Unreal Engine. Local Multiplayer☆12Apr 24, 2019Updated 6 years ago
- Copilot with deepseek and more...☆13Mar 7, 2025Updated 11 months ago
- A Simple, Explainable Vision Language Model for detecting manifacturing defects into products☆14Sep 23, 2025Updated 4 months ago
- Fast and memory-efficient exact attention - Windows wheels☆36Apr 30, 2025Updated 9 months ago