Smart OpenAI‑compatible proxy for llama.cpp: manages slots, saves/restores KV cache to disk, routes requests by prefix similarity, and protects hot slots from being overwritten. Accelerates long prompts (30–60k tokens) via instant reuse or fast on‑demand restore; supports SSE streaming and non‑stream JSON over /v1/chat/completions.
☆46Nov 14, 2025Updated 7 months ago
Alternatives and similar repositories for proxycache
Users that are interested in proxycache are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- LLM backed Fantasy Tribe Game☆19Nov 21, 2024Updated last year
- Development enviroment in docker.☆17Jun 21, 2016Updated 9 years ago
- Proxy for OpenAI☆16Sep 2, 2025Updated 9 months ago
- Fork of "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆17Nov 27, 2024Updated last year
- NPX/Docker package that creates Ollama API server and forward requests to Gemni/OpenAI/Deepseek/Kimi K2. Mainly purpose to use Free tier …☆35Oct 5, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- world's stupidest moe llm in 103M parameters☆20Jul 18, 2025Updated 10 months ago
- Interactive terminals for AI agents, built for what you can't --yes away. SSH+MFA, GRUB/U-Boot, debconf installers, SOL/serial consoles, …☆81Mar 18, 2026Updated 2 months ago
- Google Cast protocol v2 implementation for Sming allowing you to control your smart TV or cast device from a microcontroller.☆11Feb 13, 2026Updated 4 months ago
- TaskFlowAI is a lightweight and flexible framework designed for creating AI-driven task pipelines and multi-agent workflows. It provides …☆17Nov 18, 2024Updated last year
- Mojolicious lite and bootstrap based simple cms☆10Nov 25, 2022Updated 3 years ago
- A Modern GDB Frontend☆42Dec 7, 2025Updated 6 months ago
- Shared video playback and controls for a group of people watching the same video. Uses WebRTC☆12Mar 15, 2019Updated 7 years ago
- IceCash. Касса Linux. Рабочее место кассира под linux с использованием web интерфейса. С драйвером к Штрих-М ФРК.☆25Jun 9, 2015Updated 11 years ago
- a network tunneling proxy☆36Jun 2, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- SBOM generator for Debian-based distributions☆31May 29, 2026Updated 2 weeks ago
- ☆23Mar 26, 2026Updated 2 months ago
- "Bubble Universe" display hack☆15Oct 17, 2023Updated 2 years ago
- Arduino Synthesizer Sampler☆16Mar 21, 2026Updated 2 months ago
- “There is no such thing as a moral or an immoral book. Books are well written, or badly written.” I want to find all the well written con…☆20Nov 6, 2024Updated last year
- Jailer is an eBPF-based process jailing system that provides mandatory access control (MAC) for Linux. It tracks processes using BPF task…☆53Mar 16, 2026Updated 2 months ago
- ☆11Nov 10, 2024Updated last year
- ☆16Dec 16, 2024Updated last year
- Connect Channel Messenger, Zalo, Viber, Skype, Telegram☆13Nov 5, 2018Updated 7 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ia-search | internet archive file browser☆36Apr 20, 2026Updated last month
- Downloads books from the amazon web reader☆31Oct 15, 2025Updated 8 months ago
- 💜 The slightly more compromising Python code formatter☆11Feb 26, 2021Updated 5 years ago
- Build custom ReST api's on top of Frappe☆22Nov 25, 2021Updated 4 years ago
- Docker image for Mojolicious☆16May 11, 2026Updated last month
- First-class state management via state machines☆66Jul 16, 2016Updated 9 years ago
- IPs(subnets) used by Yandex Home(Alice)☆17Jan 30, 2025Updated last year
- Cloud service management and operation repository for reference archtiecture☆16Jul 28, 2019Updated 6 years ago
- LLM based file organizer☆31Mar 24, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- For a number of years now, work has been proceeding in order to bring to perfection the crudely-conceived idea of a machine that would no…☆14Nov 12, 2025Updated 7 months ago
- Fast LLM swapping with sleep/wake support, compatible with vllm, llama.cpp, etc. llama-swap fork.☆46Apr 5, 2026Updated 2 months ago
- AI agent rules: markdown files for Claude.md, ChatGPT, Copilot, Cursor, Windsurf, and more.☆25Updated this week
- APRS symbol index used by aprs.fi☆16Mar 18, 2021Updated 5 years ago
- Little python functions that make life easier☆13Jul 21, 2021Updated 4 years ago
- Design low-traffic neighbourhoods in your web browser☆21Mar 16, 2026Updated 2 months ago
- ☆25Sep 6, 2024Updated last year