robbiemu / llama-gguf-optimize
Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.
☆14Updated 2 months ago
Alternatives and similar repositories for llama-gguf-optimize:
Users that are interested in llama-gguf-optimize are comparing it to the libraries listed below
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆29Updated this week
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆21Updated 3 months ago
- Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker.☆21Updated 4 months ago
- V.I.S.O.R., my in-development AI-powered voice assistant with integrated memory!☆35Updated this week
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Updated 4 months ago
- ☆17Updated 3 months ago
- ☆17Updated last month
- LLM backed Fantasy Tribe Game☆18Updated 4 months ago
- Build HTML artefacts with Ollama☆11Updated 3 months ago
- LLM Chat is an open-source serverless alternative to ChatGPT.☆33Updated 6 months ago
- ☆18Updated 6 months ago
- Yet Another (LLM) Web UI, made with Gemini☆11Updated 2 months ago
- Large-Language-Model to Machine Interface project.☆18Updated last year
- Mycomind Daemon: A mycelium-inspired, advanced Mixture-of-Memory-RAG-Agents (MoMRA) cognitive assistant that combines multiple AI models …☆31Updated 8 months ago
- Crow is a Desktop AI Assistant☆32Updated 7 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆32Updated 8 months ago
- ☆22Updated 7 months ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆21Updated 8 months ago
- ☆27Updated 6 months ago
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Updated 6 months ago
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools.☆20Updated 3 weeks ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated 11 months ago
- Modified Beam Search with periodical restart☆12Updated 6 months ago
- Browser extension that lets you summarize and chat with any webpage using a local LLM of your choice.☆19Updated 4 months ago
- A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM☆12Updated 2 weeks ago