finnchen11 / VLLM_PromptCacheLinks
Optimize vLLM with persistent system prompt caching and block reuse for faster, memory-efficient inference.
☆53Updated this week
Alternatives and similar repositories for VLLM_PromptCache
Users that are interested in VLLM_PromptCache are comparing it to the libraries listed below
Sorting:
- A bibliometric visualization platform that integrates Gestalt design principles, keyword extraction algorithms, temporal algorithms, mach…☆89Updated 3 months ago
- Desktop Tiny Agent is a lightweight, modular desktop intelligent agent framework. It offers plugin extensibility, task scheduling (sync/a…☆80Updated last month
- ☆42Updated 8 months ago
- a rather fast time struct getter☆80Updated 2 months ago
- Main Project of AIDE☆91Updated 8 months ago
- 验证码识别☆12Updated 3 years ago
- ☆42Updated 6 months ago
- ☆78Updated last month
- Go bindings for the CUDA Driver and Runtime APIs, cuBLAS, and cuDNN.☆72Updated last week
- Source code of Fuyao, built on Nightcore☆17Updated last year
- Advanced Driving Assistance System based on Jetson Nano☆83Updated 3 months ago
- A lightweight and easy-to-use RPC framework created by Bruce Pang☆124Updated 7 months ago
- [PRL 2025, APSIPA 2022] Syllable Analysis Data Augmentation (SADA), This project introduces a glyph dictionary and grammar-aware augmenta…☆68Updated last month
- ☆138Updated last year
- Kafka Dog is a lightweight desktop application for visualizing and managing Apache Kafka. It provides a user-friendly graphical interface…☆125Updated 10 months ago
- ☆12Updated 2 years ago
- 以太坊世界杯竞猜项目☆14Updated last year
- Some of the libraries (docs) on the RISCV64 architecture are easy for users to install and deploy 一些riscv64 架构上面的库☆69Updated last month
- ☆12Updated 11 months ago
- The code for paper "Learning from Committee: Reasoning Distillation from a Mixture of Teachers with Peer-Review" accepted by ACL 2025.☆93Updated 4 months ago
- NuGet Go SDK☆31Updated last week
- An Integrated Library for Tuning, Deploying and Interpreting Genomic Models☆118Updated 3 weeks ago
- Cascade is a production-ready, high-performance, and low-latency audio stream processing library designed for Voice Activity Detection (V…☆81Updated 3 weeks ago
- Example project using universal links as deeplinks to switch iOS apps.☆13Updated last year
- excel转为go结构和json(go读取excel)☆40Updated 7 months ago
- ☆135Updated last year
- [TKDE2025] Next-Generation Database Interfaces: A Survey of LLM-based Text-to-SQL | A curated list of resources (surveys, papers, benchma…☆308Updated 2 weeks ago
- Gobi Web 是一个现代化的商业智能(BI)系统前端界面,基于 Vue 3 和 Element Plus 构建。☆64Updated 3 months ago
- High-performance Go BLAS/LAPACK with Intel MKL/OpenBLAS acceleration support☆45Updated last week
- ☆143Updated last year