robbiemu / llama-gguf-optimizeLinks
Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.
☆17Updated 11 months ago
Alternatives and similar repositories for llama-gguf-optimize
Users that are interested in llama-gguf-optimize are comparing it to the libraries listed below
Sorting:
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆33Updated 2 months ago
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆25Updated last year
- Browser extension that lets you summarize and chat with any webpage using a local LLM of your choice.☆22Updated last year
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆23Updated last year
- Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker.☆34Updated 5 months ago
- ☆17Updated last year
- ☆51Updated last year
- ☆24Updated 10 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Updated 3 months ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆46Updated 7 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Updated last year
- MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…☆14Updated 9 months ago
- Visual Tagger is a JavaScript tool that visually highlights HTML elements for AIs, aiding in identifying interactive components on web pa…☆11Updated last year
- Attend - to what matters.☆17Updated 9 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆25Updated 8 months ago
- A micro LLM multi-agent system for data analysis☆17Updated 7 months ago
- AI Search engine☆12Updated 2 months ago
- A unified library for interacting with various AI APIs through a standardized interface.☆32Updated 9 months ago
- ☆19Updated 5 months ago
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆22Updated last year
- Automated LLM novelist☆46Updated last year
- ☆15Updated 8 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆53Updated last year
- Serving LLMs in the HF-Transformers format via a PyFlask API☆72Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31Updated 7 months ago
- "a towel is about the most massively useful thing an interstellar AI hitchhiker can have"☆48Updated last year
- Make Qwen3 Think like Gemini 2.5 Pro | Open webui function☆25Updated 7 months ago
- Local LLM inference & management server with built-in OpenAI API☆31Updated last year
- Yet Another (LLM) Web UI, made with Gemini☆12Updated 11 months ago
- LlamaCards is a web application that provides a dynamic interface for interacting with LLM models in real-time. This app allows users to …☆39Updated last year