finnchen11 / VLLM_PromptCacheLinks
Optimize vLLM with persistent system prompt caching and block reuse for faster, memory-efficient inference.
☆53Updated 4 months ago
Alternatives and similar repositories for VLLM_PromptCache
Users that are interested in VLLM_PromptCache are comparing it to the libraries listed below
Sorting:
- ☆32Updated 2 months ago
- ☆80Updated last month
- Advanced Driving Assistance System based on Jetson Nano☆86Updated 6 months ago
- ☆143Updated last year
- ☆135Updated last year
- ☆135Updated last year
- ☆42Updated last year
- Desktop Tiny Agent is a lightweight, modular desktop intelligent agent framework. It offers plugin extensibility, task scheduling (sync/a…☆80Updated 5 months ago
- Kubernetes Operator for managing OpenResty with custom CRDs (OpenResty, Server, Location, Upstream, RateLimitPolicy)☆50Updated 8 months ago
- ☆42Updated 2 months ago
- Dataset and evaluation code of ISDrama(ACM-MM 2025): Immersive Spatial Drama Generation through Multimodal Prompting☆236Updated 5 months ago
- A pytorch implementation of the paper "TreeLoRA: Efficient Continual Learning via Layer-Wise LoRAs Guided by a Hierarchical Gradient-Simi…☆344Updated last month
- Main Project of AIDE☆91Updated last year
- A bibliometric visualization platform that integrates Gestalt design principles, keyword extraction algorithms, temporal algorithms, mach…☆89Updated 3 months ago
- 根据RESTful API软件设 计风格并基于NestJS框架开发的一款后端开发模板集成了数据库(Mysql,Mongodb)、缓存(Redis)、非对称算法RSA,实现了基本的身份验证守卫以及CPU过载保护。它帮你集成了大部分基础功能让你可以专注于主要的业务开发☆55Updated this week
- ☆13Updated 3 years ago
- Analysis and visualization of multi-omics data. In ongoing development: multi-modal fusion, sparse learning, and spatio-temporal effects.…☆206Updated 3 weeks ago
- Build a complete experiment pipeline for your PyTorch MIP model in 10 seconds.☆86Updated this week
- excel转为go结构和json(go读取excel)☆40Updated 11 months ago
- Some of the libraries (docs) on the RISCV64 architecture are easy for users to install and deploy 一些riscv64 架构上面的库☆69Updated 5 months ago
- [PRL 2025, APSIPA 2022] Syllable Analysis Data Augmentation (SADA), This project introduces a glyph dictionary and grammar-aware augmenta…☆68Updated 5 months ago
- a rather fast time struct getter☆80Updated 6 months ago
- An Interaction Fiction Demo Powered AI Dungeon☆84Updated 4 months ago
- Example project using universal links as deeplinks to switch iOS apps.☆13Updated last year
- ☆43Updated 6 months ago
- ☆22Updated 4 months ago
- 以太坊世界杯竞猜项目☆14Updated 2 years ago
- A lightweight and easy-to-use RPC framework created by Bruce Pang☆125Updated 11 months ago
- 用Hexo的方式管理TypeCho(使用Github Actions自动更新文章到TypeCho)☆83Updated 9 months ago
- DeepRug☆40Updated last year