finnchen11 / VLLM_PromptCacheLinks
Optimize vLLM with persistent system prompt caching and block reuse for faster, memory-efficient inference.
☆53Updated 3 weeks ago
Alternatives and similar repositories for VLLM_PromptCache
Users that are interested in VLLM_PromptCache are comparing it to the libraries listed below
Sorting:
- A bibliometric visualization platform that integrates Gestalt design principles, keyword extraction algorithms, temporal algorithms, mach…☆89Updated 4 months ago
- A transparent, minimal, and hackable agent framework. ~300 lines of readable code. Full control, no magic.☆108Updated this week
- Main Project of AIDE☆91Updated 8 months ago
- ☆42Updated 9 months ago
- ☆42Updated 6 months ago
- The code for paper "Learning from Committee: Reasoning Distillation from a Mixture of Teachers with Peer-Review" accepted by ACL 2025.☆100Updated 5 months ago
- ☆41Updated last year
- WaveFormer: A Lightweight Transformer Model for sEMG-based Gesture Recognition☆85Updated last week
- Desktop Tiny Agent is a lightweight, modular desktop intelligent agent framework. It offers plugin extensibility, task scheduling (sync/a…☆80Updated 2 months ago
- This is a useful development tool that supports mocking for both GraphQL and RESTful APIs.☆22Updated last year
- Kubernetes Operator for managing OpenResty with custom CRDs (OpenResty, Server, Location, Upstream, RateLimitPolicy)☆49Updated 5 months ago
- Automatic Texture Mapping Software for Oblique Photogrammetry Models☆47Updated 8 months ago
- Pure RL to post-train base models for social reasoning capabilities. Lightweight replication of DeepSeek-R1-Zero with Social IQa dataset.☆38Updated 7 months ago
- Advanced Driving Assistance System based on Jetson Nano☆83Updated 3 months ago
- ☆78Updated 3 weeks ago
- Gobi Web 是一个现代化的商业智能(BI)系统前端界面,基于 Vue 3 和 Element Plus 构建。☆64Updated 4 months ago
- ☆138Updated last year
- High-performance Go BLAS/LAPACK with Intel MKL/OpenBLAS acceleration support☆45Updated last month
- ☆11Updated 2 years ago
- 以太坊世界杯竞猜项目☆14Updated last year
- ☆143Updated last year
- ☆121Updated 3 months ago
- 绝区零(ZenlessZoneZero) 一键式自动化工具 | 零号空洞 | 每日任务 | 奖励签到 | 自动清体力☆55Updated last week
- 根据RESTful API软件设计风格并基于NestJS框架开发的一款后端开发模板集成了数据库(Mysql,Mongodb)、缓存(Redis)、非对称算法RSA,实现了基本的身份验证守卫以及CPU过载保护。它帮你集成了大部分基础功能让你可以专注于主要的业务开发☆56Updated 3 months ago
- ☆43Updated 2 months ago
- Repository for the paper:☆69Updated last year
- Example project using universal links as deeplinks to switch iOS apps.☆13Updated last year
- ☆135Updated last year
- A modified version of alphalens with updated dependencies and fixes☆129Updated 3 weeks ago
- An Interaction Fiction Demo Powered AI Dungeon☆85Updated last month