robbiemu / llama-gguf-optimizeLinks
Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.
☆14Updated 4 months ago
Alternatives and similar repositories for llama-gguf-optimize
Users that are interested in llama-gguf-optimize are comparing it to the libraries listed below
Sorting:
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆22Updated 5 months ago
- AI Search engine☆12Updated 3 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Updated 6 months ago
- Locally hosted AI Agent Python Tool To Generate Novel Research Hypothesis + Titles + Abstracts☆23Updated last month
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆30Updated this week
- ☆22Updated 9 months ago
- Large-Language-Model to Machine Interface project.☆19Updated last year
- Visual Tagger is a JavaScript tool that visually highlights HTML elements for AIs, aiding in identifying interactive components on web pa…☆10Updated 7 months ago
- A proxy that hosts multiple single-model runners such as LLama.cpp and vLLM☆11Updated last week
- Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools.☆30Updated 3 months ago
- Attend - to what matters.☆15Updated 3 months ago
- Web Interface for Vision Language Models Including InternVLM2☆22Updated 10 months ago
- Yet Another (LLM) Web UI, made with Gemini☆12Updated 5 months ago
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆36Updated 3 months ago
- LLM Chat is an open-source serverless alternative to ChatGPT.☆34Updated 8 months ago
- A unified library for interacting with various AI APIs through a standardized interface.☆30Updated 2 months ago
- Mycomind Daemon: A mycelium-inspired, advanced Mixture-of-Memory-RAG-Agents (MoMRA) cognitive assistant that combines multiple AI models …☆32Updated 10 months ago
- Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker.☆28Updated 6 months ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated last year
- Build HTML artefacts with Ollama☆11Updated 5 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆23Updated 2 months ago
- ☆17Updated 5 months ago
- Glyphs, acting as collaboratively defined symbols linking related concepts, add a layer of multidimensional semantic richness to user-AI …☆48Updated 3 months ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆22Updated 11 months ago
- ☆16Updated last week
- ☆21Updated 4 months ago
- Browser extension that lets you summarize and chat with any webpage using a local LLM of your choice.☆20Updated 7 months ago
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Updated 9 months ago
- V.I.S.O.R., my in-development AI-powered voice assistant with integrated memory!☆36Updated last month
- An API for VoiceCraft.☆25Updated 11 months ago