☆57Oct 10, 2025Updated 8 months ago
Alternatives and similar repositories for gguf-tensor-overrider
Users that are interested in gguf-tensor-overrider are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Orchestrator Kit for Agentic Reasoning - OrKa is a modular AI orchestration system that transforms Large Language Models (LLMs) into comp…☆96Jun 9, 2026Updated last week
- Yet Another (LLM) Web UI, made with Gemini☆12Dec 25, 2024Updated last year
- A collection python tools used to create gguf files and upload to huggingface☆17Jun 6, 2026Updated last week
- Benchmarking tool for vLLM inference performance with GPU monitoring☆51Jun 7, 2026Updated last week
- Service for testing out the new Qwen2.5 omni model☆62Apr 30, 2025Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Produce your own Dynamic 3.0 Quants and achieve optimum accuracy & SOTA quantization performance! Input a target size and the toolchain w…☆139Updated this week
- ☆94Jul 7, 2025Updated 11 months ago
- 🔍 Dead-simple local file selector that preps your docs for LLM prompts, no cloud needed. Drop in your files, get perfectly formatted con…☆11Jan 11, 2025Updated last year
- ide-cap-chan is a utility for batch image captioning with natural language using various VL models☆14May 8, 2026Updated last month
- CompChomper is a framework for measuring how LLMs perform at code completion.☆21Apr 29, 2025Updated last year
- adds a few extra samplers and schedulers to the dropdowns in recent A1111-derived webUIs for Stable Diffusion☆25Dec 5, 2025Updated 6 months ago
- MeshBBS is a lightweight, text-based bulletin board system designed to run over Meshtastic radios, enabling simple games, utilities, and …☆64Oct 17, 2025Updated 8 months ago
- An fully autonomous agent that accesses the browser and performs tasks.☆18Apr 25, 2025Updated last year
- Implements harmful/harmless refusal removal using pure HF Transformers☆21May 8, 2025Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Cohere Toolkit is a collection of prebuilt components enabling users to quickly build and deploy RAG applications.☆30Jan 19, 2025Updated last year
- Thank you LenAnderson I am yoinking this!☆28May 3, 2026Updated last month
- ☆24Jan 22, 2025Updated last year
- A forward proxy to turn network traffic into personal memory for AI agents☆38Mar 30, 2026Updated 2 months ago
- ☆51Feb 19, 2025Updated last year
- Cleanai (https://github.com/willmil11/cleanai) except I'm making it in c now. Fast and clean from the start this time :)☆15May 29, 2026Updated 2 weeks ago
- JotItNow is a AI Voice Notes App☆26Mar 6, 2025Updated last year
- Yii2 gallery module☆13Dec 28, 2016Updated 9 years ago
- A simple, "Ollama-like" tool for managing and running GGUF language models from your terminal.☆25Jan 2, 2026Updated 5 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Learn faster with the power of AI☆17Updated this week
- A dynamic multi-expert AI architecture running on a single consumer GPU (RTX 3060).☆36Dec 2, 2025Updated 6 months ago
- Kiwix ZIM-to-vector RAG system for local, offline LLM knowledge retrieval☆24Mar 24, 2026Updated 2 months ago
- A Windows tool to query various LLM AIs. Supports branched conversations, history and summaries among others.☆36May 11, 2026Updated last month
- POwershell PredictOr POwered by coPilOt.☆13Oct 14, 2025Updated 8 months ago
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆24Apr 1, 2025Updated last year
- Hill Space is All You Need☆17Jul 11, 2025Updated 11 months ago
- Generate a llama-quantize command to copy the quantization parameters of any GGUF☆35Apr 20, 2026Updated last month
- ☆10Jan 23, 2025Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆25Oct 13, 2025Updated 8 months ago
- Context Query language for Agents☆63Apr 13, 2026Updated 2 months ago
- KoboldCpp Smart Launcher with GPU Layer and Tensor Override Tuning☆30May 18, 2025Updated last year
- Your Interface to Intelligence☆49Apr 23, 2026Updated last month
- Lightweight C inference for Qwen3 GGUF. Multiturn prefix caching & batch processing.☆25Sep 1, 2025Updated 9 months ago
- A minimal interface for AI Companion that runs entirely in your browser.☆191Updated this week
- Local first human friendly agents toolkit for the browser and Nodejs☆44Updated this week