Smart OpenAI‑compatible proxy for llama.cpp: manages slots, saves/restores KV cache to disk, routes requests by prefix similarity, and protects hot slots from being overwritten. Accelerates long prompts (30–60k tokens) via instant reuse or fast on‑demand restore; supports SSE streaming and non‑stream JSON over /v1/chat/completions.
☆38Nov 14, 2025Updated 5 months ago
Alternatives and similar repositories for proxycache
Users that are interested in proxycache are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM backed Fantasy Tribe Game☆19Nov 21, 2024Updated last year
- Proxy for OpenAI☆16Sep 2, 2025Updated 8 months ago
- world's stupidest moe llm in 103M parameters☆20Jul 18, 2025Updated 9 months ago
- Skills for creating high quality skills and agents☆134Apr 5, 2026Updated last month
- triton for AMD gfx906 GPUs, e.g. Radeon VII / MI50 / MI60☆46Dec 8, 2025Updated 4 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Email to HTTP proxy for Papra document ingestion☆15Jan 4, 2026Updated 4 months ago
- Google Cast protocol v2 implementation for Sming allowing you to control your smart TV or cast device from a microcontroller.☆11Feb 13, 2026Updated 2 months ago
- ☆17Jun 16, 2024Updated last year
- Helper scripts to be run on Frappe sites☆16Apr 16, 2026Updated 2 weeks ago
- MCP tools for Rust Context Engineering (rustdocs, rust analyzer)☆17Feb 8, 2026Updated 2 months ago
- Push Notification Relay Server for Frappe Apps☆12Dec 24, 2025Updated 4 months ago
- A Modern GDB Frontend☆38Dec 7, 2025Updated 4 months ago
- Dynamic Swagger UI for frappe Apps☆19Sep 30, 2024Updated last year
- TaskFlowAI is a lightweight and flexible framework designed for creating AI-driven task pipelines and multi-agent workflows. It provides …☆18Nov 18, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Mojolicious lite and bootstrap based simple cms☆10Nov 25, 2022Updated 3 years ago
- CLI secret management☆16Mar 19, 2026Updated last month
- Awesome AI Benchmarks☆31Jan 16, 2026Updated 3 months ago
- Transform YouTube videos into a compounding knowledge base with transcripts, vision analysis, and agentic search. Works as an MCP server …☆93Apr 13, 2026Updated 3 weeks ago
- ☆15Apr 3, 2025Updated last year
- Tool To Manage Linux Kernel Modules☆27May 25, 2023Updated 2 years ago
- IceCash. Касса Linux. Рабочее место кассира под linux с использованием web интерфейса. С драйвером к Штрих-М ФРК.☆25Jun 9, 2015Updated 10 years ago
- A tiny PID 1 for containers, written in x86-64 NASM and ARM64 GAS.☆19Feb 23, 2026Updated 2 months ago
- this is to send custom emails from google spreadsheet using google app scripts☆16Nov 30, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- a frappe app for my medium post☆16Jun 19, 2024Updated last year
- Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.☆19Jan 10, 2025Updated last year
- ☆23Mar 26, 2026Updated last month
- Temporal is a library of useful Date and Time functions (plus a Redis database) that can be integrated with other Frappe framework applic…☆16Dec 5, 2025Updated 5 months ago
- “There is no such thing as a moral or an immoral book. Books are well written, or badly written.” I want to find all the well written con…☆20Nov 6, 2024Updated last year
- You can run passively cooled single slot Tesla GPU/KI cards in a HP Proliant with a modified iLO ROM that take care of the fan and conseq…☆19Dec 16, 2025Updated 4 months ago
- a .net port of GhostCursor https://github.com/Xetera/ghost-cursor☆12May 30, 2021Updated 4 years ago
- A simple Mojolicious application example for authenticating a user and maintaining a session☆11Sep 20, 2019Updated 6 years ago
- Прокси-сервер для подключения Алисы к Dialogflow☆13Mar 23, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Gives realtime and asynchronous feedback to parts created in the FreeCAD Part Design and Sketcher workbenches.☆18Dec 29, 2024Updated last year
- Binary application to clean up .cargo/registry & .cargo/git cache☆27Apr 3, 2026Updated last month
- dig many at once☆22Jun 18, 2024Updated last year
- ☆16Dec 16, 2024Updated last year
- AdaLLM is an NVFP4-first inference runtime for Ada Lovelace (RTX 4090) with FP8 KV cache and custom decode kernels. This repo targets NVF…☆117Feb 15, 2026Updated 2 months ago
- Blinka makes her debut on the big screen! With this library you can use CircuitPython displayio code on PC and Raspberry Pi to output to …☆13Feb 21, 2026Updated 2 months ago
- Working with LLM in C#☆16Apr 1, 2026Updated last month