[ACL 2026] Towards Efficient Large Language Model Serving: A Survey on System-Aware KV Cache Optimization
☆291Apr 21, 2026Updated last month
Alternatives and similar repositories for Awesome-KV-Cache-Optimization
Users that are interested in Awesome-KV-Cache-Optimization are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [FPGA 2020] A systematic framework for optimizing OpenCL applications on FPGAs☆20Apr 9, 2023Updated 3 years ago
- Universal memory layer for AI Agents. It provides scalable, extensible, and interoperable memory storage and retrieval to streamline AI a…☆3,091May 22, 2026Updated last week
- Fast Parallel Probabilistic Graphical Model Learning and Inference [IPDPS'22, PPoPP'23, USENIX ATC'24]☆77Oct 26, 2025Updated 7 months ago
- [ACM MM 2025] SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation. https://arxiv.org/abs/2506.03139☆79Nov 10, 2025Updated 6 months ago
- TorchHook: A PyTorch hooks manager, providing convenient interfaces to capture feature maps and debug models.☆15Oct 1, 2025Updated 7 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆19Dec 1, 2024Updated last year
- 全语言制品仓库,涵盖npm、Maven、PyPi、Docker、Gradle、SBT、Cocoapods、Swift、RPM、Debian、PHP、Go、Pub、Ivy、NuGet、Conda、Cargo、Conan、Yarn、GitLFS、Helm、OHPM等主流工具,涵…☆2,334Dec 24, 2025Updated 5 months ago
- 53AI Hub is an open-source AI portal, which enables you to quickly build a operational-level AI portal to launch and operate AI agents, p…☆4,951May 21, 2026Updated last week
- OTFS-channel-estimation☆68Jun 21, 2025Updated 11 months ago
- A fast gigapixel processing system☆1,253Dec 10, 2024Updated last year
- Custom validation starter for Spring Boot☆92Apr 15, 2026Updated last month
- Free SQLite for VSCode.Support writing SQL statements☆524Sep 18, 2025Updated 8 months ago
- Let's use AI to Earn!☆15,981May 21, 2026Updated last week
- 🔥🔥🔥100%开源的企业级商城系统源码下载,使用最新前沿技术栈,同时支持java、php版本,基于SpringBoot3+Vue3+Ts,支持H5、微信小程序、公众号、IOS、安卓、鸿蒙等多端,高性能高并发,极易二次开发。标准版支持PC/移动端页面装修、主题颜色一键切…☆3,122Mar 18, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official code release for paper "Robo-Imagine: A Robotic Video Generation Model, For Autoregressive Long-Term Task Video Generation With …☆31Jul 13, 2025Updated 10 months ago
- 用户面试平台☆24Aug 1, 2025Updated 9 months ago
- DeepBot is a system-level AI assistant built for both personal productivity and enterprise workflows — one-click setup, seamless experien …☆2,237Updated this week
- ☆2,680Aug 25, 2025Updated 9 months ago
- https://x.com/robodotai☆226Mar 29, 2025Updated last year
- Official Repository of Cooragent.☆1,734Apr 29, 2026Updated last month
- 🔥🔥🔥📌 规则引擎开源版 📌 RuleEngine 基于web可视化配置,简单高效快捷。业务逻辑实现不再依赖于代码开发,可零代码实现复杂业务逻辑!☆581Dec 20, 2025Updated 5 months ago
- [CVPR2026] RAGTrack: Language-aware RGBT Tracking with Retrieval-Augmented Generation☆64Updated this week
- Official code repository for paper "VisionTS++: Cross-Modal Time Series Foundation Model with Continual Pre-trained Visual Backbones"☆35Nov 9, 2025Updated 6 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [2025CVPR] FlowRAM: Grounding Flow Matching Policy with Region-Aware Mamba Framework for Robotic Manipulation☆53Nov 11, 2025Updated 6 months ago
- HAGAMEAI is a training framework for AI game evolution.☆99Jul 7, 2025Updated 10 months ago
- amrnb codec from 3gpp official website http://www.3gpp.org/DynaReport/26204.htm☆10Apr 30, 2014Updated 12 years ago
- 🔥🔥🔥可视化拖拽式大数据集成平台、大数据平台、大数据,包含数据流、数据源、数据对齐、查询模板、完善的监控等。像画流程图一样且无代码方式同步、清洗数据。☆616Apr 12, 2026Updated last month
- 小遥搜索,听懂你的话、看懂你的图,用AI找到本地任何文件。让搜索像聊天一样简单。XiaoyaoSearch: Understands your words, reads your images, finds any local file with AI. Making se…☆1,054May 22, 2026Updated last week
- ☆14Feb 2, 2021Updated 5 years ago
- ☆486Aug 20, 2025Updated 9 months ago
- Medical SAM3: A Foundation Model for Universal Prompt-Driven Medical Image Segmentation☆165Jan 20, 2026Updated 4 months ago
- 生产级iOS网络通信、架构实战 基于 CocoaAsyncSocket 打造的高性能底层通信框架,日均处理万级别消息,真实服务于企业客户!来源于多年IM开发经验总结,完整呈现从单TCP架构到企业级多路复用架构的演进之路。☆433Mar 3, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official repository of the 3DV 2025 paper "LapisGS: Layered Progressive 3D Gaussian Splatting for Adaptive Streaming"☆50Sep 10, 2025Updated 8 months ago
- AIFlowy is an enterprise-grade AI application development platform based on Java, comparable to products like Dify and Coze.☆853May 22, 2026Updated last week
- TOMs is a fully open-source, high-performance, systematic, plugin-oriented, and scenario-agnostic general-purpose development framework.☆481Dec 4, 2025Updated 5 months ago
- China Unicom's Yuanjing Wanwu Agent Platform is an enterprise-grade, multi-tenant AI agent development platform. It helps users build app…☆2,514May 15, 2026Updated 2 weeks ago
- ☆52Sep 16, 2022Updated 3 years ago
- This report evaluates the feasibility of using spatial logic and computer vision for analysing athletes, spheres, and movements in sports…☆64Feb 21, 2026Updated 3 months ago
- GraphRag vs Embeddings☆16Jul 14, 2024Updated last year