finnchen11/VLLM_PromptCache

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/finnchen11/VLLM_PromptCache)

finnchen11 / VLLM_PromptCache

Optimize vLLM with persistent system prompt caching and block reuse for faster, memory-efficient inference.

☆53

Alternatives and similar repositories for VLLM_PromptCache

Users that are interested in VLLM_PromptCache are comparing it to the libraries listed below

Sorting:

notoookay / rag-ler
View on GitHub
The implementation of RAG-LER
☆17Sep 19, 2025Updated 5 months ago
SaladDay / miniDBMS
View on GitHub
利用Python实现的DBMS
☆15May 16, 2023Updated 2 years ago
KleinMoretti-dev / OpenKimi-main
View on GitHub
no
☆26Apr 23, 2025Updated 10 months ago
FarhanWANG / Final-Year-Project
View on GitHub
☆17Sep 20, 2021Updated 4 years ago
gocnn / gocu
View on GitHub
Go bindings for the CUDA Driver and Runtime APIs, cuBLAS, and cuDNN.
☆153Dec 24, 2025Updated 2 months ago
yifan1207 / Dog-Depression-Sound-Therapy-Device
View on GitHub
A wearable, a necklace-like medical device using Alpha Wave sound therapy to treat depression in dogs
☆16Mar 22, 2024Updated last year
sam5-hub / DeImaginify
View on GitHub
AI-powered tools designed to enhance, restore, and personalize your visual content.
☆18Mar 7, 2024Updated last year
360CVGroup / RelaCtrl
View on GitHub
Efficient controlnet for DiTs
☆383May 10, 2025Updated 9 months ago
1Hun0ter1 / MGCF-Net
View on GitHub
MGCF-Net for Phishing URLs Detection
☆50May 20, 2025Updated 9 months ago
yuanpengtu / DreamMask
View on GitHub
Source code for SIGGRAPH25 DreamMask: Boosting Open-vocabulary Panoptic Segmentation with Synthetic Data
☆63Nov 19, 2025Updated 3 months ago
jarvanstack / marknum
View on GitHub
自动生成 markdown 标题序号
☆27Sep 16, 2023Updated 2 years ago
s3ndd / gometricus
View on GitHub
Metrics for Go — lightweight, concurrent-safe, and with built-in support for exporting Counters, Gauges, and Timers to DataDog via DogSta…
☆41Jun 8, 2025Updated 8 months ago
QianYiYiMomo / SiriusLite
View on GitHub
☆23Jan 27, 2022Updated 4 years ago
waterlang / bullet-chat
View on GitHub
弹幕系统
☆28Dec 4, 2022Updated 3 years ago
austinyu / ujson5
View on GitHub
A fast JSON5 encoder/decoder for Python
☆43Apr 16, 2025Updated 10 months ago
hkvincent / vpeodometer
View on GitHub
the pedometer with excitation system
☆30Oct 29, 2021Updated 4 years ago
kubeservice-stack / custom-limit-range
View on GitHub
kubernetes pod bandwidth rate limiting, setting bandwidth quota & custom-limitrange
☆28Feb 12, 2026Updated 2 weeks ago
mongofs / sim
View on GitHub
基于go语言开发的长链接服务，基于goroutine对连接进行包装，支持ack消息回执，心跳检测，分布式部署，对外开放rpc、http两种调用模式，提供在线人数统计、对点消息发送、全盘消息发送等多种模式
☆25Mar 28, 2023Updated 2 years ago
xiegangqingnian1021 / devops
View on GitHub
手搓云计算运维开发第一阶段私有云Dashboard 第二阶段CICD
☆35Dec 19, 2024Updated last year
chengshiwen / kubectl-resource-versions
View on GitHub
Print the supported API resources along with groups/versions on the server
☆23Aug 14, 2021Updated 4 years ago
sudo-yf / Test2504
View on GitHub
☆10Feb 16, 2026Updated last week
Sma1lboy / RegTool
View on GitHub
RegTool supports a wide range of software registries and package managers to enhance your development workflow.
☆38Jul 23, 2024Updated last year
EdGrass / CLINews
View on GitHub
💻 CLI News 是一个命令行新闻工具，从 RSS feed 获取新闻并完成翻译，在摸鱼的时候方便地浏览新闻内容
☆46Jan 31, 2025Updated last year
vortezwohl / Bhakti
View on GitHub
An easy-to-use vector database.
☆37Apr 3, 2025Updated 10 months ago
AaronZ345 / kdd99-attack-detection
View on GitHub
use sklearn to detect two types of network attacks
☆34Jun 6, 2019Updated 6 years ago
ZhenhaoPeng / WhiteLagoon
View on GitHub
☆48Apr 14, 2025Updated 10 months ago
waunx / AdapSafe
View on GitHub
AdapSafe: Adaptive and Safe-Certified Deep Reinforcement Learning-Based Frequency Control for Carbon-neutral Power Systems
☆26Feb 19, 2025Updated last year
jorge-martinez-gil / graphcodebert-interpretability
View on GitHub
Augmenting the Interpretability of GraphCodeBERT for Code Similarity Tasks
☆20Dec 3, 2025Updated 2 months ago
Linxiushen / MultiAgent-Optimization
View on GitHub
Advanced Multi-Agent Optimization System featuring intelligent routing strategies, semantic memory optimization, distributed coordination…
☆16Aug 15, 2025Updated 6 months ago
gsmlg-dev / phoenix-react
View on GitHub
React Render for Phoenix Framework
☆53Oct 30, 2025Updated 4 months ago
ErdunE / NortheasternMiami
View on GitHub
Assignment, homework and everything in Northeastern University Miami
☆32Updated this week
luo-junyu / GALA
View on GitHub
[TPAMI24] GALA: Graph Diffusion-based Alignment with Jigsaw for Source-free Domain Adaptation
☆36Jan 12, 2025Updated last year
jiaxiaojunQAQ / FGSM-PGK
View on GitHub
Improving fast adversarial training with prior-guided knowledge (TPAMI2024)
☆43Apr 21, 2024Updated last year
BirchKwok / lynsedb
View on GitHub
A pure Python-implemented, lightweight, server-optional, multi-end compatible, vector database deployable locally or remotely.
☆39Aug 11, 2025Updated 6 months ago
qingkuai-js / qingkuai
View on GitHub
A lightweight(qing) and fast(kuai) front end framework, it will make you development work easier(qingkuai)
☆79Jun 5, 2025Updated 8 months ago
jchanghong / kotlin-backend-tool-library
View on GitHub
A kotlin backend development tool library，mainly includes common kotlin extensions for daily projects。轻松将kotlin加入现有java后端项目，自己日常工具类
☆27Jun 16, 2022Updated 3 years ago
randomAndre / api-key-manager
View on GitHub
A secure and efficient API key management system that helps developers and teams easily manage API keys for various AI model【一个安全且高效的API密…
☆18Mar 12, 2025Updated 11 months ago
hs-knowledge-base / hs-knowledge-base
View on GitHub
一个集文档、代码实践于一体的技术知识库平台。包含文档、代码编辑、管理后台等5个应用的monorepo项目。采用Next.js、NestJS等现代技术栈，为开发者提供学习和实践平台。
☆17Jul 21, 2025Updated 7 months ago
simongu20070911 / quantitative-pricing-agents
View on GitHub
☆24Dec 2, 2025Updated 2 months ago