robbiemu / llama-gguf-optimize
Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.
☆14Updated 3 months ago
Alternatives and similar repositories for llama-gguf-optimize:
Users that are interested in llama-gguf-optimize are comparing it to the libraries listed below
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆21Updated 3 months ago
- Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker.☆21Updated 5 months ago
- Visual Tagger is a JavaScript tool that visually highlights HTML elements for AIs, aiding in identifying interactive components on web pa…☆10Updated 5 months ago
- Yet Another (LLM) Web UI, made with Gemini☆11Updated 3 months ago
- ☆17Updated 4 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Updated 4 months ago
- Attend - to what matters.☆14Updated last month
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Updated 7 months ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆30Updated this week
- Build HTML artefacts with Ollama☆11Updated 4 months ago
- LLM backed Fantasy Tribe Game☆18Updated 4 months ago
- AI Search engine☆12Updated last month
- ☆17Updated 2 months ago
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆20Updated 2 weeks ago
- Mycomind Daemon: A mycelium-inspired, advanced Mixture-of-Memory-RAG-Agents (MoMRA) cognitive assistant that combines multiple AI models …☆31Updated 9 months ago
- An OpenAI API compatible images server to generate or manipulate images.☆16Updated 2 months ago
- Modified Beam Search with periodical restart☆12Updated 7 months ago
- ☆22Updated 8 months ago
- ☆27Updated 7 months ago
- Large-Language-Model to Machine Interface project.☆18Updated last year
- Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!☆23Updated 2 weeks ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆33Updated 8 months ago
- a browser gui for nvidia smi☆18Updated last month
- Text generation in Python, as easy as possible☆56Updated this week
- An API for VoiceCraft.☆25Updated 9 months ago
- fork of litellm that is open source☆19Updated 4 months ago
- LLM Chat is an open-source serverless alternative to ChatGPT.☆33Updated 7 months ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated 11 months ago
- Controllable Language Model Interactions in TypeScript☆9Updated 11 months ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆21Updated 9 months ago