Optimize vLLM with persistent system prompt caching and block reuse for faster, memory-efficient inference.
☆53Oct 6, 2025Updated 8 months ago
Alternatives and similar repositories for VLLM_PromptCache
Users that are interested in VLLM_PromptCache are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- no☆26Apr 23, 2025Updated last year
- Go bindings for the CUDA Driver and Runtime APIs, cuBLAS, and cuDNN.☆154Jun 22, 2026Updated last week
- The implementation of RAG-LER☆17Sep 19, 2025Updated 9 months ago
- 利用Python实现的DBMS☆15May 16, 2023Updated 3 years ago
- Source code for SIGGRAPH25 DreamMask: Boosting Open-vocabulary Panoptic Segmentation with Synthetic Data☆62Nov 19, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆17Sep 20, 2021Updated 4 years ago
- Efficient controlnet for DiTs☆387May 10, 2025Updated last year
- ☆26Dec 2, 2025Updated 6 months ago
- GraphiContact is a robust method for 3D human reconstruction and contact point prediction from monocular RGB images, utilizing pose-aware…☆52Mar 24, 2026Updated 3 months ago
- [AAAI2023] AdapSafe: Adaptive and Safe-Certified Deep Reinforcement Learning-Based Frequency Control for Carbon-neutral Power Systems☆30Feb 19, 2025Updated last year
- kubernetes pod bandwidth rate limiting, setting bandwidth quota & custom-limitrange☆28Updated this week
- ☆18May 14, 2025Updated last year
- A tool ot export, analyse and visualize your transactions, rewards and commissions of your liquidity mining pools or DEX transactions.☆12Feb 13, 2022Updated 4 years ago
- 基于 ClaudeCode-CLI 源码进行修复完成的项目☆120May 25, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A wearable, a necklace-like medical device using Alpha Wave sound therapy to treat depression in dogs☆15Mar 22, 2024Updated 2 years ago
- AI-powered tools designed to enhance, restore, and personalize your visual content.☆18Mar 7, 2024Updated 2 years ago
- ☆10Feb 16, 2026Updated 4 months ago
- Official implementation of CIT(IJCV'24): Cascaded iterative transformer for jointly predicting facial landmark, occlusion probability and…☆39Dec 10, 2025Updated 6 months ago
- Black-box, open-source red-team testing for AI agents. Point Argus at any HTTP, gRPC, or browser-using agent endpoint, run 500+ adversari…☆190May 28, 2026Updated last month
- ☆33May 15, 2026Updated last month
- AdaRubric: Adaptive Dynamic Rubric Evaluator for Agent Trajectories☆313Jun 7, 2026Updated 3 weeks ago
- We will send our supply to the Education Foundation after the migrating.☆101May 16, 2025Updated last year
- [ECCV 2024] The repository for 'Tri$^{2}$-plane: Volumetric Avatar Reconstruction with Feature Pyramid'☆141May 4, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Augmenting the Interpretability of GraphCodeBERT for Code Similarity Tasks☆25May 23, 2026Updated last month
- use sklearn to detect two types of network attacks☆34Jun 6, 2019Updated 7 years ago
- Repository of "Modal-NexT: toward unified heterogeneous cellular data integration"☆84Jun 16, 2025Updated last year
- React Render for Phoenix Framework☆52Mar 6, 2026Updated 3 months ago
- A secure and efficient API key management system that helps developers and teams easily manage API keys for various AI model【一个安全且高效的API密…☆22Mar 12, 2025Updated last year
- Advanced Multi-Agent Optimization System featuring intelligent routing strategies, semantic memory optimization, distributed coordination…☆15Aug 15, 2025Updated 10 months ago
- 自动生成 markdown 标题序号☆27Sep 16, 2023Updated 2 years ago
- ☆22Jan 27, 2022Updated 4 years ago
- ☆68May 16, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Metrics for Go — lightweight, concurrent-safe, and with built-in support for exporting Counters, Gauges, and Timers to DataDog via DogSta…☆41Jun 8, 2025Updated last year
- 弹幕系统☆28Dec 4, 2022Updated 3 years ago
- High-performance Go BLAS/LAPACK with Intel MKL/OpenBLAS acceleration support☆46Jun 22, 2026Updated last week
- Backend for HR Admin Console with Spring Boot☆12Jan 26, 2024Updated 2 years ago
- A fast JSON5 encoder/decoder for Python☆43Apr 16, 2025Updated last year
- ☆370Apr 1, 2026Updated 3 months ago
- the pedometer with excitation system☆30Oct 29, 2021Updated 4 years ago