robbiemu / llama-gguf-optimizeLinks
Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.
☆18Updated last year
Alternatives and similar repositories for llama-gguf-optimize
Users that are interested in llama-gguf-optimize are comparing it to the libraries listed below
Sorting:
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆35Updated 3 months ago
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆25Updated last year
- AI Search engine☆12Updated 4 months ago
- Browser extension that lets you summarize and chat with any webpage using a local LLM of your choice.☆22Updated last year
- ACE-Step: A Step Towards Music Generation Foundation Model☆47Updated 8 months ago
- Get aid from local LLMs right in your PowerShell☆15Updated 8 months ago
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆23Updated last year
- MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…☆14Updated 10 months ago
- Attend - to what matters.☆17Updated 11 months ago
- ☆24Updated last year
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆26Updated 10 months ago
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆22Updated 10 months ago
- ☆51Updated last year
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆39Updated this week
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Updated 4 months ago
- ☆17Updated last year
- Chatbot-to-speech using Orpheus TTS model. Interactive console app.☆21Updated 9 months ago
- ☆15Updated 11 months ago
- A unified library for interacting with various AI APIs through a standardized interface.☆31Updated 10 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Updated last year
- Generate a llama-quantize command to copy the quantization parameters of any GGUF☆28Updated last week
- Make Qwen3 Think like Gemini 2.5 Pro | Open webui function☆25Updated 8 months ago
- ☆19Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31Updated 9 months ago
- run ollama & gguf easily with a single command☆52Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆41Updated last year
- Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker.☆35Updated 6 months ago
- Yet Another (LLM) Web UI, made with Gemini☆12Updated last year
- Crow is a Desktop AI Assistant☆32Updated last year
- A real-time shared memory layer for multi-agent LLM systems.☆53Updated 2 weeks ago