finnchen11 / VLLM_PromptCacheLinks
Optimize vLLM with persistent system prompt caching and block reuse for faster, memory-efficient inference.
☆53Updated 2 months ago
Alternatives and similar repositories for VLLM_PromptCache
Users that are interested in VLLM_PromptCache are comparing it to the libraries listed below
Sorting:
- A pytorch implementation of the paper "TreeLoRA: Efficient Continual Learning via Layer-Wise LoRAs Guided by a Hierarchical Gradient-Simi…☆342Updated 2 months ago
- ☆42Updated 10 months ago
- ☆22Updated 2 months ago
- ☆43Updated 4 months ago
- ☆135Updated last year
- ☆324Updated 3 weeks ago
- Advanced Driving Assistance System based on Jetson Nano☆84Updated 5 months ago
- 以太坊世界杯竞猜项目☆14Updated 2 years ago
- AI phone agents for business.☆18Updated 10 months ago
- ☆42Updated 2 weeks ago
- A bibliometric visualization platform that integrates Gestalt design principles, keyword extraction algorithms, temporal algorithms, mach…☆89Updated last month
- DIVER: Reinforced Diffusion Breaks Imitation Bottlenecks in End-to-End Autonomous Driving☆103Updated this week
- ☆199Updated last month
- ☆79Updated last week
- Example project using universal links as deeplinks to switch iOS apps.☆13Updated last year
- ☆137Updated last year
- 根据RESTful API软件设计风格并基于NestJS框架开发的一款后端开发模板集成了数据库(Mysql,Mongodb)、缓存(Redis)、非对称算法RSA,实现了基本的身份验证守卫以及CPU过载保护。它帮你集成了大部分基础功能让你可以专注于主要的业务开发☆56Updated last week
- ☆143Updated last year
- Cascade is a production-ready, high-performance, and low-latency audio stream processing library designed for Voice Activity Detection (V…☆83Updated 3 weeks ago
- Some of the libraries (docs) on the RISCV64 architecture are easy for users to install and deploy 一些riscv64 架构上面的库☆69Updated 3 months ago
- A QR-based ordering system for a seamless dining experience. Deploy on docker. | 基于二维码的扫码点餐系统,Docker容器化部署☆50Updated 3 weeks ago
- Main Project of AIDE☆91Updated 10 months ago
- 用Hexo的方式管理TypeCho(使用Github Actions自动更新文章到TypeCho)☆83Updated 7 months ago
- 验证码识别☆12Updated 3 years ago
- ☆24Updated last week
- [PRL 2025, APSIPA 2022] Syllable Analysis Data Augmentation (SADA), This project introduces a glyph dictionary and grammar-aware augmenta…☆68Updated 3 months ago
- 3D Generative AI | Text/Image to 3DFI production-ready 3D Assets.☆33Updated 8 months ago
- Dataset and evaluation code of ISDrama(ACM-MM 2025): Immersive Spatial Drama Generation through Multimodal Prompting☆236Updated 3 months ago
- Repository for the paper:☆69Updated last year
- Gobi Web 是一个现代化的商业智能(BI)系统前端界面,基于 Vue 3 和 Element Plus 构建。☆66Updated 5 months ago