[ACL 2026] Towards Efficient Large Language Model Serving: A Survey on System-Aware KV Cache Optimization
☆260Apr 7, 2026Updated last week
Alternatives and similar repositories for Awesome-KV-Cache-Optimization
Users that are interested in Awesome-KV-Cache-Optimization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [CVPR 2024] Efficient Hyperparameter Optimization with Adaptive Fidelity Identification☆11Jul 12, 2024Updated last year
- Universal memory layer for AI Agents. It provides scalable, extensible, and interoperable memory storage and retrieval to streamline AI a…☆4,104Apr 10, 2026Updated last week
- (eBook,PDFs Translation) A multilingual eBook processing tool supporting all eBook formats. Features online and offline translation while…☆1,488Sep 28, 2025Updated 6 months ago
- TorchHook: A PyTorch hooks manager, providing convenient interfaces to capture feature maps and debug models.☆15Oct 1, 2025Updated 6 months ago
- ☆22Dec 1, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆58Jul 1, 2025Updated 9 months ago
- OTFS-channel-estimation☆62Jun 21, 2025Updated 9 months ago
- 53AI Hub is an open-source AI portal, which enables you to quickly build a operational-level AI portal to launch and operate AI agents, p…☆6,762Mar 4, 2026Updated last month
- A fast gigapixel processing system☆1,393Dec 10, 2024Updated last year
- DeepBot 是一个系统级 AI 助手,满足个人桌面助手使用,同时会更多探索企业生产提效方向,一键安装、丝 滑体验,飞书友好。☆1,242Apr 10, 2026Updated last week
- Free SQLite for VSCode.Support writing SQL statements☆769Sep 18, 2025Updated 7 months ago
- 🔥🔥🔥100%开源的企业级商城系统源码下载,使用最新前沿技术栈,同时支持java、php版本,基于SpringBoot3+Vue3+Ts,支持H5、微信小程序、公众号、IOS、安卓、鸿蒙等多端,高性能高并发,极易二次开发。标准版支持PC/移动端页面装修、主题颜色一键切…☆4,887Mar 18, 2026Updated last month
- Official code release for paper "Robo-Imagine: A Robotic Video Generation Model, For Autoregressive Long-Term Task Video Generation With …☆29Jul 13, 2025Updated 9 months ago
- 用户面试平台☆24Aug 1, 2025Updated 8 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆54Jan 20, 2025Updated last year
- ☆52Nov 9, 2025Updated 5 months ago
- 【CVPR 2025 Highlight】MonSter: Marry Monodepth to Stereo Unleashes Power☆709Dec 2, 2025Updated 4 months ago
- ☆2,734Aug 25, 2025Updated 7 months ago
- Official Repository of Cooragent. Free Try on https://www.cooragent.com/☆1,876Mar 25, 2026Updated 3 weeks ago
- Dynamic Rollout Allocation and Advantage Modulation for Policy Optimization (DynaMO) - Official Implementation☆86Apr 11, 2026Updated last week
- A cpp version of NIC driver☆14Jan 19, 2026Updated 3 months ago
- HAGAMEAI is a training framework for AI game evolution.☆122Jul 7, 2025Updated 9 months ago
- A powerful serialization framework for Python objects with automatic type registration and validation. Extract from AgentSmith, released …☆14Mar 2, 2026Updated last month
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 实现实时视频流分析、事件检测与智能感知。☆38Aug 18, 2025Updated 8 months ago
- the #official repo™ documenting and discussing the schema of pxon☆32Mar 22, 2018Updated 8 years ago
- 生产级iOS网络通信、架构实战 基于 CocoaAsyncSocket 打造的高性能底层通信框架,日均处理万级别消息,真实服务于企业客户!来源于多年IM开发经验总结,完整呈现从单TCP架构到企业级多路复用架构的演进之路。☆514Mar 3, 2026Updated last month
- A library for users to write (experiment in research) configurations in Python Dict or JSON format, read and write parameter value via do…☆1,589Aug 22, 2024Updated last year
- AIFlowy is an enterprise-grade AI application development platform based on Java, comparable to products like Dify and Coze.☆1,240Apr 10, 2026Updated last week
- TOMs is a fully open-source, high-performance, systematic, plugin-oriented, and scenario-agnostic general-purpose development framework.☆695Dec 4, 2025Updated 4 months ago
- China Unicom's Yuanjing Wanwu Agent Platform is an enterprise-grade, multi-tenant AI agent development platform. It helps users build app…☆3,371Apr 10, 2026Updated last week
- ☆64Sep 16, 2022Updated 3 years ago
- px2rem support to Brackets, convert px to rem or rem to px.☆13Mar 18, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- 🔥 An agile development and testing platform designed to empower small and medium-sized enterprises to build their own R&D systems, strea…☆2,358Mar 25, 2026Updated 3 weeks ago
- ☆10Feb 10, 2018Updated 8 years ago
- 🧠 dPro + Polymarket operational toolkit for market/account reads, YES/NO trading, order book access, readiness checks, and secure creden…☆42Mar 13, 2026Updated last month
- A TLS parser that can dig TLS information☆30Nov 20, 2018Updated 7 years ago
- [ICLR 26] TempFlow-GRPO (Temporal Flow GRPO), a principled GRPO framework that captures and exploits the temporal structure inherent in f…☆595Nov 24, 2025Updated 4 months ago
- 🚀🚀 Efficient implementations of Native Sparse Attention☆751Sep 29, 2025Updated 6 months ago
- CausalVLR: A Toolbox and Benchmark for Vision-Language Causal Reasoning (多模态因果推理开源框架)☆1,446Oct 11, 2025Updated 6 months ago