robbiemu / llama-gguf-optimizeLinks
Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.
☆17Updated 9 months ago
Alternatives and similar repositories for llama-gguf-optimize
Users that are interested in llama-gguf-optimize are comparing it to the libraries listed below
Sorting:
- Browser extension that lets you summarize and chat with any webpage using a local LLM of your choice.☆22Updated last year
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆25Updated 10 months ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆32Updated 2 weeks ago
- ☆51Updated last year
- ☆17Updated 10 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆25Updated 7 months ago
- AI Search engine☆12Updated last month
- Tcurtsni: Reverse Instruction Chat, ever wonder what your LLM wants to ask you?☆23Updated last year
- Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker.☆34Updated 4 months ago
- An API for VoiceCraft.☆25Updated last year
- Bookmarklet to pull and run hugging face GGUF models in Ollama☆17Updated last year
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Updated last month
- Make Qwen3 Think like Gemini 2.5 Pro | Open webui function☆23Updated 5 months ago
- Automated LLM novelist☆44Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31Updated 6 months ago
- Generate a llama-quantize command to copy the quantization parameters of any GGUF☆24Updated 3 months ago
- MilimoChat: Privacy-first, self-hosted AI chat with customizable personas, context-aware memory, and local analytics. Built on Python/Str…☆14Updated 7 months ago
- A unified library for interacting with various AI APIs through a standardized interface.☆31Updated 7 months ago
- Gradio based tool to run opensource LLM models directly from Huggingface☆96Updated last year
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆58Updated 11 months ago
- ☆23Updated last year
- ☆24Updated 9 months ago
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆39Updated 8 months ago
- A micro LLM multi-agent system for data analysis☆17Updated 6 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆53Updated last year
- ACE-Step: A Step Towards Music Generation Foundation Model☆45Updated 5 months ago
- LlamaCards is a web application that provides a dynamic interface for interacting with LLM models in real-time. This app allows users to …☆39Updated last year
- "a towel is about the most massively useful thing an interstellar AI hitchhiker can have"☆48Updated last year
- Game Companion AI is an advanced application designed to enhance the gaming experience by providing real-time analysis and interpretation…☆53Updated last year
- Serving LLMs in the HF-Transformers format via a PyFlask API☆71Updated last year