Official Implementation for [ICLR26] DefensiveKV: Taming the Fragility of KV Cache Eviction in LLM Inference
☆40Mar 28, 2026Updated 2 weeks ago
Alternatives and similar repositories for DefensiveKV
Users that are interested in DefensiveKV are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".☆16Sep 15, 2024Updated last year
- The Official Implementation of Ada-KV [NeurIPS 2025]☆131Nov 26, 2025Updated 4 months ago
- Official Implementation of Spatial-TTT: Streaming Visual-based Spatial Intelligence with Test-Time Training☆169Mar 13, 2026Updated last month
- PyTorch implementation of Language model compression with weighted low-rank factorization☆13Jun 28, 2023Updated 2 years ago
- ☆47Mar 15, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official implementation for LaCo (EMNLP 2024 Findings)☆21Oct 3, 2024Updated last year
- LLM-guided hyperparameter tuning☆10Oct 7, 2023Updated 2 years ago
- ☆11Feb 15, 2023Updated 3 years ago
- The raw data and analysis code for the Microsoft Academic paper recommender system user study conducted in 2018.☆17May 21, 2019Updated 6 years ago
- My record about learning the course MIT-6.824☆13Mar 28, 2022Updated 4 years ago
- Project for CS101016 and CS100160, Tongji University. Use Verilog HDL to build a CPU.☆10Mar 20, 2021Updated 5 years ago
- ☆42Mar 24, 2026Updated 3 weeks ago
- TPLink IPC Control☆20Jul 24, 2024Updated last year
- ☆140Aug 18, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆14Aug 3, 2024Updated last year
- ☆11Mar 9, 2022Updated 4 years ago
- Source code of "FlowWalker: A Memory-efficient and High-performance GPU-based Dynamic Graph Random Walk Framework"☆11Oct 23, 2024Updated last year
- Unofficial implementations of block/layer-wise pruning methods for LLMs.☆78Apr 29, 2024Updated last year
- a game like qqtang qq堂 游戏☆15Dec 8, 2022Updated 3 years ago
- ThinK: Thinner Key Cache by Query-Driven Pruning☆29Feb 11, 2025Updated last year
- Incorporating the memory mechanism into the transformer and employing a parallel weighting structure to obtain a better utterance-level r…☆22Oct 4, 2025Updated 6 months ago
- ☆12Mar 24, 2025Updated last year
- ☆15Feb 20, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆97Sep 10, 2025Updated 7 months ago
- ☆16Jun 25, 2024Updated last year
- ☆17Apr 13, 2025Updated last year
- 23秋季工程化C程序设计代码仓库,包括lab1-5的实验代码和实验报告,感兴趣的话就点个star吧~☆12Mar 1, 2025Updated last year
- The agentic plotting "IDE" built for everyone☆52Apr 2, 2026Updated last week
- ☆11Oct 10, 2021Updated 4 years ago
- [EMNLP 2025 Main] SpecVLM: Enhancing Speculative Decoding of Video LLMs via Verifier-Guided Token Pruning☆40Updated this week
- Computational analysis of nucleic acids structures using graph neural networks☆15Mar 25, 2024Updated 2 years ago
- 同济的计算机组成原理实验要求的54条指令CPU☆13Feb 27, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A collection of papers on LLM applications in the IoT field.☆18Jan 21, 2026Updated 2 months ago
- Experimental deep learning framework written in Rust☆15Nov 2, 2022Updated 3 years ago
- [NeurIPS 2024] State Space Models on Temporal Graphs: A First-Principles Study☆16Dec 31, 2024Updated last year
- ☆18Apr 21, 2024Updated last year
- 2022级华南师范大学编译原理实验☆14Jun 16, 2024Updated last year
- ☆22Jan 31, 2025Updated last year
- A PyG-based package of spectral GNNs with benchmark evaluations (SIGMOD 2026).☆19Aug 20, 2025Updated 7 months ago