Optimize vLLM with persistent system prompt caching and block reuse for faster, memory-efficient inference.
☆53Oct 6, 2025Updated 5 months ago
Alternatives and similar repositories for VLLM_PromptCache
Users that are interested in VLLM_PromptCache are comparing it to the libraries listed below
Sorting:
- no☆26Apr 23, 2025Updated 10 months ago
- Go bindings for the CUDA Driver and Runtime APIs, cuBLAS, and cuDNN.☆153Dec 24, 2025Updated 2 months ago
- The implementation of RAG-LER☆17Sep 19, 2025Updated 6 months ago
- 利用Python实现的DBMS☆15May 16, 2023Updated 2 years ago
- Source code for SIGGRAPH25 DreamMask: Boosting Open-vocabulary Panoptic Segmentation with Synthetic Data☆63Nov 19, 2025Updated 4 months ago
- ☆17Sep 20, 2021Updated 4 years ago
- Efficient controlnet for DiTs☆383May 10, 2025Updated 10 months ago
- ☆24Dec 2, 2025Updated 3 months ago
- AdapSafe: Adaptive and Safe-Certified Deep Reinforcement Learning-Based Frequency Control for Carbon-neutral Power Systems☆27Feb 19, 2025Updated last year
- kubernetes pod bandwidth rate limiting, setting bandwidth quota & custom-limitrange☆28Mar 9, 2026Updated last week
- A tool ot export, analyse and visualize your transactions, rewards and commissions of your liquidity mining pools or DEX transactions.☆12Feb 13, 2022Updated 4 years ago
- ☆18May 14, 2025Updated 10 months ago
- A wearable, a necklace-like medical device using Alpha Wave sound therapy to treat depression in dogs☆16Mar 22, 2024Updated last year
- AI-powered tools designed to enhance, restore, and personalize your visual content.☆18Mar 7, 2024Updated 2 years ago
- ☆10Feb 16, 2026Updated last month
- Official implementation of CIT(IJCV'24): Cascaded iterative transformer for jointly predicting facial landmark, occlusion probability and…☆38Dec 10, 2025Updated 3 months ago
- ☆30Jan 27, 2026Updated last month
- We will send our supply to the Education Foundation after the migrating.☆102May 16, 2025Updated 10 months ago
- use sklearn to detect two types of network attacks☆34Jun 6, 2019Updated 6 years ago
- Repository of "Modal-NexT: toward unified heterogeneous cellular data integration"☆86Jun 16, 2025Updated 9 months ago
- The repository for 'Tri$^{2}$-plane: Volumetric Avatar Reconstruction with Feature Pyramid'☆141May 4, 2025Updated 10 months ago
- Augmenting the Interpretability of GraphCodeBERT for Code Similarity Tasks☆22Dec 3, 2025Updated 3 months ago
- React Render for Phoenix Framework☆52Mar 6, 2026Updated 2 weeks ago
- A secure and efficient API key management system that helps developers and teams easily manage API keys for various AI model【一个安全且高效的API密…☆18Mar 12, 2025Updated last year
- Advanced Multi-Agent Optimization System featuring intelligent routing strategies, semantic memory optimization, distributed coordination…☆17Aug 15, 2025Updated 7 months ago
- 自动生成 markdown 标题序号☆27Sep 16, 2023Updated 2 years ago
- ☆23Jan 27, 2022Updated 4 years ago
- ☆68May 16, 2023Updated 2 years ago
- 弹幕系统☆28Dec 4, 2022Updated 3 years ago
- Metrics for Go — lightweight, concurrent-safe, and with built-in support for exporting Counters, Gauges, and Timers to DataDog via DogSta…☆41Jun 8, 2025Updated 9 months ago
- High-performance Go BLAS/LAPACK with Intel MKL/OpenBLAS acceleration support☆46Dec 6, 2025Updated 3 months ago
- 💻 CLI News 是一个命令行新闻工具,从 RSS feed 获取新闻并完成翻译,在摸鱼的时候方便地浏览新闻内容☆46Jan 31, 2025Updated last year
- Backend for HR Admin Console with Spring Boot☆12Jan 26, 2024Updated 2 years ago
- A fast JSON5 encoder/decoder for Python☆43Apr 16, 2025Updated 11 months ago
- ☆371Sep 6, 2025Updated 6 months ago
- HTU21D full-featured driver library for general-purpose MCU and Linux.☆47Oct 25, 2025Updated 4 months ago
- the pedometer with excitation system☆30Oct 29, 2021Updated 4 years ago
- EAViz(离线版,在线版参见EAViz-OL)是一款AI赋能的临床级癫痫分析工具,聚焦“算法易用性+临床实用性”,整合脑电/视频多模态数据与深度学习模型,构建“数据输入-模型推理-可视化输出”全链路闭环,打通科研算法与临床用户的使用壁垒,为癫痫诊断、治疗决策及科研工作提供…☆25Dec 4, 2025Updated 3 months ago
- ☆162Oct 9, 2025Updated 5 months ago