Official Implementation for [ICLR26] DefensiveKV: Taming the Fragility of KV Cache Eviction in LLM Inference
☆31Mar 19, 2026Updated this week
Alternatives and similar repositories for DefensiveKV
Users that are interested in DefensiveKV are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".☆16Sep 15, 2024Updated last year
- The Official Implementation of Ada-KV [NeurIPS 2025]☆128Nov 26, 2025Updated 3 months ago
- Official Implementation of Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training☆143Mar 13, 2026Updated last week
- ☆46Mar 15, 2025Updated last year
- ☆27Updated this week
- LLM-guided hyperparameter tuning☆10Oct 7, 2023Updated 2 years ago
- ☆12Feb 15, 2023Updated 3 years ago
- The raw data and analysis code for the Microsoft Academic paper recommender system user study conducted in 2018.☆17May 21, 2019Updated 6 years ago
- My record about learning the course MIT-6.824☆13Mar 28, 2022Updated 3 years ago
- Project for CS101016 and CS100160, Tongji University. Use Verilog HDL to build a CPU.☆10Mar 20, 2021Updated 5 years ago
- The agentic plotting "IDE" built for everyone☆39Updated this week
- TPLink IPC Control☆19Jul 24, 2024Updated last year
- ☆14Aug 3, 2024Updated last year
- ☆11Mar 9, 2022Updated 4 years ago
- Source code of "FlowWalker: A Memory-efficient and High-performance GPU-based Dynamic Graph Random Walk Framework"☆11Oct 23, 2024Updated last year
- a game like qqtang qq堂 游戏☆15Dec 8, 2022Updated 3 years ago
- ThinK: Thinner Key Cache by Query-Driven Pruning☆27Feb 11, 2025Updated last year
- Incorporating the memory mechanism into the transformer and employing a parallel weighting structure to obtain a better utterance-level r…☆22Oct 4, 2025Updated 5 months ago
- ☆12Mar 24, 2025Updated last year
- ☆93Sep 10, 2025Updated 6 months ago
- ☆15Feb 20, 2024Updated 2 years ago
- [EMNLP 2025 Main] SpecVLM: Enhancing Speculative Decoding of Video LLMs via Verifier-Guided Token Pruning☆34Jan 11, 2026Updated 2 months ago
- ☆16Jun 25, 2024Updated last year
- ☆16Apr 13, 2025Updated 11 months ago
- 23秋季工程化C程序设计代码仓库,包括lab1-5的实验代码和实验报告,感兴趣的话就点个star吧~☆12Mar 1, 2025Updated last year
- ☆11Oct 10, 2021Updated 4 years ago
- Computational analysis of nucleic acids structures using graph neural networks☆15Mar 25, 2024Updated last year
- 同济的计算机组成原理实验要求的54条指令CPU☆13Feb 27, 2020Updated 6 years ago
- A collection of papers on LLM applications in the IoT field.☆17Jan 21, 2026Updated 2 months ago
- Experimental deep learning framework written in Rust☆15Nov 2, 2022Updated 3 years ago
- [NeurIPS 2024] State Space Models on Temporal Graphs: A First-Principles Study☆16Dec 31, 2024Updated last year
- ☆18Apr 21, 2024Updated last year
- 2022级华南师范大学编译原理实验☆13Jun 16, 2024Updated last year
- ☆22Jan 31, 2025Updated last year
- A PyG-based package of spectral GNNs with benchmark evaluations (SIGMOD 2026).☆19Aug 20, 2025Updated 7 months ago
- 华南师范大学Beamer模板☆15Nov 11, 2020Updated 5 years ago
- the code of MoG☆20Aug 6, 2024Updated last year
- 华南师范大学 2021级(2023-2024下学期) 编译原理项目☆16Sep 22, 2024Updated last year
- The evaluation framework for training-free sparse attention in LLMs☆122Jan 27, 2026Updated last month