Official Implementation for [ICLR26] DefensiveKV: Taming the Fragility of KV Cache Eviction in LLM Inference
☆43Mar 28, 2026Updated last month
Alternatives and similar repositories for DefensiveKV
Users that are interested in DefensiveKV are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".☆17Sep 15, 2024Updated last year
- The Official Implementation of Ada-KV [NeurIPS 2025]☆132Nov 26, 2025Updated 5 months ago
- PyTorch implementation of Language model compression with weighted low-rank factorization☆13Jun 28, 2023Updated 2 years ago
- Official Implementation of Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training☆181Mar 13, 2026Updated last month
- ☆47Mar 15, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official implementation for LaCo (EMNLP 2024 Findings)☆21Oct 3, 2024Updated last year
- LLM-guided hyperparameter tuning☆10Oct 7, 2023Updated 2 years ago
- ☆11Feb 15, 2023Updated 3 years ago
- The raw data and analysis code for the Microsoft Academic paper recommender system user study conducted in 2018.☆17May 21, 2019Updated 6 years ago
- My record about learning the course MIT-6.824☆13Mar 28, 2022Updated 4 years ago
- Project for CS101016 and CS100160, Tongji University. Use Verilog HDL to build a CPU.☆10Mar 20, 2021Updated 5 years ago
- ☆45Mar 24, 2026Updated last month
- TPLink IPC Control☆20Jul 24, 2024Updated last year
- ☆140Aug 18, 2025Updated 8 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆14Aug 3, 2024Updated last year
- ☆11Mar 9, 2022Updated 4 years ago
- Source code of "FlowWalker: A Memory-efficient and High-performance GPU-based Dynamic Graph Random Walk Framework"☆11Oct 23, 2024Updated last year
- Unofficial implementations of block/layer-wise pruning methods for LLMs.☆78Apr 29, 2024Updated 2 years ago
- a game like qqtang qq堂 游戏☆15Dec 8, 2022Updated 3 years ago
- ThinK: Thinner Key Cache by Query-Driven Pruning☆29Feb 11, 2025Updated last year
- Incorporating the memory mechanism into the transformer and employing a parallel weighting structure to obtain a better utterance-level r…☆22Oct 4, 2025Updated 7 months ago
- Hands-on workshop: Build a multi-agent AI system from scratch — Deep Research Agent + Writing Workflow served as MCP servers. Includes co…☆202Updated this week
- ☆12Mar 24, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆15Feb 20, 2024Updated 2 years ago
- ☆101Sep 10, 2025Updated 7 months ago
- ☆16Jun 25, 2024Updated last year
- ☆17Apr 13, 2025Updated last year
- 23秋季工程化C程序设计代码仓库,包括lab1-5的实验代码和实验报告,感兴趣的话就点个star吧~☆11Mar 1, 2025Updated last year
- The agentic plotting "IDE" built for everyone☆56Apr 2, 2026Updated last month
- ☆11Oct 10, 2021Updated 4 years ago
- [EMNLP 2025 Main] SpecVLM: Enhancing Speculative Decoding of Video LLMs via Verifier-Guided Token Pruning☆41Apr 16, 2026Updated 2 weeks ago
- Computational analysis of nucleic acids structures using graph neural networks☆15Mar 25, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- 同济的计算机组成原理实验要求的54条指令CPU☆13Feb 27, 2020Updated 6 years ago
- A collection of papers on LLM applications in the IoT field.☆18Jan 21, 2026Updated 3 months ago
- Experimental deep learning framework written in Rust☆15Nov 2, 2022Updated 3 years ago
- [NeurIPS 2024] State Space Models on Temporal Graphs: A First-Principles Study☆16Dec 31, 2024Updated last year
- ☆18Apr 21, 2024Updated 2 years ago
- 2022级华南师范大学编译原理实验☆15Jun 16, 2024Updated last year
- ☆23Jan 31, 2025Updated last year