Optimize vLLM with persistent system prompt caching and block reuse for faster, memory-efficient inference.
☆53Oct 6, 2025Updated 4 months ago
Alternatives and similar repositories for VLLM_PromptCache
Users that are interested in VLLM_PromptCache are comparing it to the libraries listed below
Sorting:
- The implementation of RAG-LER☆17Sep 19, 2025Updated 5 months ago
- 利用Python实现的DBMS☆15May 16, 2023Updated 2 years ago
- no☆26Apr 23, 2025Updated 10 months ago
- ☆17Sep 20, 2021Updated 4 years ago
- Go bindings for the CUDA Driver and Runtime APIs, cuBLAS, and cuDNN.☆153Dec 24, 2025Updated 2 months ago
- A wearable, a necklace-like medical device using Alpha Wave sound therapy to treat depression in dogs☆16Mar 22, 2024Updated last year
- AI-powered tools designed to enhance, restore, and personalize your visual content.☆18Mar 7, 2024Updated last year
- Efficient controlnet for DiTs☆383May 10, 2025Updated 9 months ago
- MGCF-Net for Phishing URLs Detection☆50May 20, 2025Updated 9 months ago
- Source code for SIGGRAPH25 DreamMask: Boosting Open-vocabulary Panoptic Segmentation with Synthetic Data☆63Nov 19, 2025Updated 3 months ago
- 自动生成 markdown 标题序号☆27Sep 16, 2023Updated 2 years ago
- Metrics for Go — lightweight, concurrent-safe, and with built-in support for exporting Counters, Gauges, and Timers to DataDog via DogSta…☆41Jun 8, 2025Updated 8 months ago
- ☆23Jan 27, 2022Updated 4 years ago
- 弹幕系统☆28Dec 4, 2022Updated 3 years ago
- A fast JSON5 encoder/decoder for Python☆43Apr 16, 2025Updated 10 months ago
- the pedometer with excitation system☆30Oct 29, 2021Updated 4 years ago
- kubernetes pod bandwidth rate limiting, setting bandwidth quota & custom-limitrange☆28Feb 12, 2026Updated 2 weeks ago
- 基于go语言开发的长链接服务,基于goroutine对连接进行包装,支持ack消息回执,心跳检测,分布式部署,对外开放rpc、http两种调用模式,提供在线人数统计、对点消息发送、全盘消息发送等多种模式☆25Mar 28, 2023Updated 2 years ago
- 手搓云计算运维开发 第一阶段私有云Dashboard 第二阶段CICD☆35Dec 19, 2024Updated last year
- Print the supported API resources along with groups/versions on the server☆23Aug 14, 2021Updated 4 years ago
- ☆10Feb 16, 2026Updated last week
- RegTool supports a wide range of software registries and package managers to enhance your development workflow.☆38Jul 23, 2024Updated last year
- 💻 CLI News 是一个命令行新闻工具,从 RSS feed 获取新闻并完成翻译,在摸鱼的时候方便地浏览新闻内容☆46Jan 31, 2025Updated last year
- An easy-to-use vector database.☆37Apr 3, 2025Updated 10 months ago
- use sklearn to detect two types of network attacks☆34Jun 6, 2019Updated 6 years ago
- ☆48Apr 14, 2025Updated 10 months ago
- AdapSafe: Adaptive and Safe-Certified Deep Reinforcement Learning-Based Frequency Control for Carbon-neutral Power Systems☆26Feb 19, 2025Updated last year
- Augmenting the Interpretability of GraphCodeBERT for Code Similarity Tasks☆20Dec 3, 2025Updated 2 months ago
- Advanced Multi-Agent Optimization System featuring intelligent routing strategies, semantic memory optimization, distributed coordination…☆16Aug 15, 2025Updated 6 months ago
- React Render for Phoenix Framework☆53Oct 30, 2025Updated 4 months ago
- Assignment, homework and everything in Northeastern University Miami☆32Updated this week
- [TPAMI24] GALA: Graph Diffusion-based Alignment with Jigsaw for Source-free Domain Adaptation☆36Jan 12, 2025Updated last year
- Improving fast adversarial training with prior-guided knowledge (TPAMI2024)☆43Apr 21, 2024Updated last year
- A pure Python-implemented, lightweight, server-optional, multi-end compatible, vector database deployable locally or remotely.☆39Aug 11, 2025Updated 6 months ago
- A lightweight(qing) and fast(kuai) front end framework, it will make you development work easier(qingkuai)☆79Jun 5, 2025Updated 8 months ago
- A kotlin backend development tool library,mainly includes common kotlin extensions for daily projects。轻松将kotlin加入现有java后端项目,自己日常工具类☆27Jun 16, 2022Updated 3 years ago
- A secure and efficient API key management system that helps developers and teams easily manage API keys for various AI model【一个安全且高效的API密…☆18Mar 12, 2025Updated 11 months ago
- 一个集文档、代码实践于一体的技术知识库平台。包含文档、代码编辑、管理后台等5个应用的monorepo项目。采用Next.js、NestJS等现代技术栈,为开发者提供学习和实践平台。☆17Jul 21, 2025Updated 7 months ago
- ☆24Dec 2, 2025Updated 2 months ago