k-koehler/gguf-tensor-overrider

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/k-koehler/gguf-tensor-overrider)

k-koehler / gguf-tensor-overrider

☆58

Alternatives and similar repositories for gguf-tensor-overrider

Users that are interested in gguf-tensor-overrider are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

thad0ctor / llama-server-launcher
View on GitHub
Llama Server Launcher (llama.cpp/ik_llama) GUI
☆123Updated this week
crashr / brute-llama
View on GitHub
Testbench for llama.cpp llama-server
☆15Aug 20, 2025Updated 11 months ago
kooshi / llama-swappo
View on GitHub
llama-swap + a minimal ollama compatible api
☆60May 26, 2026Updated 2 months ago
and270 / thinking_effort_processor
View on GitHub
☆93Jul 7, 2025Updated last year
JoeCastrom / mcp-chat-studio
View on GitHub
A powerful MCP testing tool with multi-provider LLM support (Ollama, OpenAI, Claude, Gemini). Test, debug, and develop MCP servers with a…
☆18Apr 28, 2026Updated 3 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
davidbrowne17 / Mimi-Voice
View on GitHub
Create Unmute voice embeddings
☆26Nov 15, 2025Updated 8 months ago
hasaranga / NativeChat
View on GitHub
win32 native frontend for llama-cli
☆14Nov 2, 2024Updated last year
autollama / autollama
View on GitHub
Anthropic's Contextual Retrieval implementation with visual chunk comparison. Preview context enrichment before/after embedding.
☆30Sep 25, 2025Updated 10 months ago
the-crypt-keeper / reasonscape
View on GitHub
Information Processing Evaluation for Large Language Models
☆66Updated this week
thad0ctor / KrunchWrapper
View on GitHub
☆18Jul 1, 2025Updated last year
lucasavila00 / LmScript
View on GitHub
Controllable Language Model Interactions in TypeScript
☆10May 17, 2024Updated 2 years ago
CalvinSturm / LocalAgent
View on GitHub
Local-first agent runtime for MCP workflows with explicit trust controls, replayable runs, and built-in evals.
☆32Jul 4, 2026Updated 3 weeks ago
fishiatee / yawullm
View on GitHub
Yet Another (LLM) Web UI, made with Gemini
☆12Dec 25, 2024Updated last year
extopico / llama-server_mcp_proxy
View on GitHub
Simple node proxy for llama-server that enables MCP use
☆19May 10, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
pwilkin / llama-runner
View on GitHub
Llama.cpp runner/swapper and proxy that emulates LMStudio / Ollama backends
☆60Aug 21, 2025Updated 11 months ago
panilya / awesome-ai-benchmarks
View on GitHub
Awesome AI Benchmarks
☆40Updated this week
syv-ai / PybberLink
View on GitHub
☆13Mar 10, 2025Updated last year
0nspaceshipearth / Hermit-AI
View on GitHub
Local AI chat application with automatic offline context injection through ZIM files.
☆47Apr 3, 2026Updated 3 months ago
openserv-labs / mcp-proxy
View on GitHub
Lightweight reverse proxy + admin UI that turns your backend endpoints into multi-tenant Model Context Protocol tools for OpenServ—or any…
☆16May 28, 2025Updated last year
gowrav-vishwakarma / ai-video-generator-editor
View on GitHub
☆23Sep 20, 2025Updated 10 months ago
rpodcast / pod-db-dash
View on GitHub
Podcast index database quality dashboard
☆15Updated this week
sevenreasons / promptcat
View on GitHub
A zero-dependency prompt manager/catalog/library in a single HTML file. Everything is stored locally in your browser. Meow. 😼
☆81Aug 14, 2025Updated 11 months ago
hjc4869 / llama.cpp
View on GitHub
LLM inference in C/C++
☆16Jul 18, 2026Updated last week
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
chu-tianxiang / exl2-for-all
View on GitHub
EXL2 quantization generalized to other models.
☆11Mar 17, 2024Updated 2 years ago
airnsk / proxycache
View on GitHub
Smart OpenAI‑compatible proxy for llama.cpp: manages slots, saves/restores KV cache to disk, routes requests by prefix similarity, and pr…
☆49Nov 14, 2025Updated 8 months ago
phildougherty / qwen2.5_omni_chat
View on GitHub
Service for testing out the new Qwen2.5 omni model
☆62Apr 30, 2025Updated last year
santinic / unvibe
View on GitHub
Generate correct code from unit-tests
☆85Mar 22, 2025Updated last year
cofiprofim / AutoSeller
View on GitHub
A roblox tool to sell UGC Limiteds
☆10Aug 9, 2025Updated 11 months ago
jabberjabberjabber / Chunkify
View on GitHub
Create text chunks which end at natural stopping points without using a tokenizer
☆26Nov 26, 2025Updated 8 months ago
suncloudsmoon / quizzer
View on GitHub
Generate Duolingo-style quiz courses from PDFs with spaced repetition, adaptive difficulty, and tutor chat.
☆16Apr 6, 2026Updated 3 months ago
trailofbits / CompChomper
View on GitHub
CompChomper is a framework for measuring how LLMs perform at code completion.
☆21Apr 29, 2025Updated last year
willmil11 / cleanai-c
View on GitHub
Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)
☆15Jul 17, 2026Updated last week
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
2dameneko / ide-cap-chan
View on GitHub
ide-cap-chan is a utility for batch image captioning with natural language using various VL models
☆14May 8, 2026Updated 2 months ago
boneylizard / Eloquent
View on GitHub
The most feature-complete local AI workstation. Multi-GPU inference, integrated Stable Diffusion + ADetailer, voice cloning, research-gra…
☆64Updated this week
Recklesz / FileAggregator-for-LLMs
View on GitHub
🔍 Dead-simple local file selector that preps your docs for LLM prompts, no cloud needed. Drop in your files, get perfectly formatted con…
☆11Jan 11, 2025Updated last year
Mungert69 / GGUFModelBuilder
View on GitHub
A collection python tools used to create gguf files and upload to huggingface
☆17Jun 6, 2026Updated last month
llama-farm / LlamaPajamas
View on GitHub
☆52Nov 17, 2025Updated 8 months ago
tg-prplx / vellium
View on GitHub
Local-first desktop AI workbench for roleplay, multi-character chat, long-form writing, RAG, MCP tools, plugins, and local models.
☆105Updated this week
wchisasa / rabbit
View on GitHub
An fully autonomous agent that accesses the browser and performs tasks.
☆18Apr 25, 2025Updated last year