Official Implementation for [ICLR26] DefensiveKV: Taming the Fragility of KV Cache Eviction in LLM Inference
☆22Feb 9, 2026Updated 3 weeks ago
Alternatives and similar repositories for DefensiveKV
Users that are interested in DefensiveKV are comparing it to the libraries listed below
Sorting:
- Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".☆16Sep 15, 2024Updated last year
- The Official Implementation of Ada-KV [NeurIPS 2025]☆128Nov 26, 2025Updated 3 months ago
- ☆43Mar 15, 2025Updated 11 months ago
- ☆90Sep 10, 2025Updated 5 months ago
- ☆12Jul 4, 2024Updated last year
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆25Jul 21, 2025Updated 7 months ago
- The official implementation of the paper "Self-Updatable Large Language Models by Integrating Context into Model Parameters"☆15May 18, 2025Updated 9 months ago
- LLM-guided hyperparameter tuning☆10Oct 7, 2023Updated 2 years ago
- A collection of papers on LLM applications in the IoT field.☆18Jan 21, 2026Updated last month
- ☆13Jul 8, 2020Updated 5 years ago
- A repo to keep all resources about interpretability in NLP organised and up to date☆12Nov 22, 2020Updated 5 years ago
- ☆18Jun 23, 2025Updated 8 months ago
- ☆13Sep 8, 2024Updated last year
- A library for handling Structural Causal Models and performing interventional and counterfactual inference on them.☆13Jul 3, 2020Updated 5 years ago
- 2022级华南师范大学编译原理实验☆13Jun 16, 2024Updated last year
- Scikit-learn vectorizer implementing "A simple but tough-to-beat baseline for sentence embeddings." by Arora, Sanjeev, Yingyu Liang, and …☆12Apr 1, 2018Updated 7 years ago
- A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …☆11Mar 18, 2023Updated 2 years ago
- PyTorch implementation of paper "Evolving Parameterized Prompt Memory for Continual Learning" in AAAI 2024 (Oral).☆14Apr 15, 2024Updated last year
- ☆12Nov 15, 2022Updated 3 years ago
- TPLink IPC Control☆19Jul 24, 2024Updated last year
- 23秋季工程化C程序设计代码仓库,包括lab1-5的实验代码和实验报告,感兴趣的话就点个star吧~☆12Mar 1, 2025Updated last year
- ☆13Jul 2, 2025Updated 8 months ago
- ☆12Jun 29, 2024Updated last year
- ☆11Oct 10, 2021Updated 4 years ago
- Source code of "FlowWalker: A Memory-efficient and High-performance GPU-based Dynamic Graph Random Walk Framework"☆11Oct 23, 2024Updated last year
- ☆12Jul 6, 2023Updated 2 years ago
- ☆16Apr 13, 2025Updated 10 months ago
- Official codes for COLING 2024 paper "Robust and Scalable Model Editing for Large Language Models": https://arxiv.org/abs/2403.17431v1☆14Mar 27, 2024Updated last year
- ☆11Aug 13, 2024Updated last year
- QuoteSum is a textual QA dataset containing Semi-Extractive Multi-source Question Answering (SEMQA) examples written by humans, based on …☆13Mar 25, 2024Updated last year
- Code for EMNLP 2021 paper "Measuring Association Between Labels and Free-Text Rationales"☆12Sep 12, 2023Updated 2 years ago
- [ICLR 2026] BARREL: Boundary-Aware Reasoning for Factual and Reliable LRMs☆17May 21, 2025Updated 9 months ago
- Implementation for <Understanding Robust Overftting of Adversarial Training and Beyond> in ICML'22.☆12Jul 1, 2022Updated 3 years ago
- An implementation for MetGen: A Module-Based Entailment Tree Generation Framework for Answer Explanation.☆13Jul 21, 2022Updated 3 years ago
- ☆11Mar 9, 2022Updated 3 years ago
- The collections of MOE (Mixture Of Expert) papers, code and tools, etc.☆12Mar 15, 2024Updated last year
- ☆13Nov 29, 2021Updated 4 years ago
- ☆20Aug 14, 2025Updated 6 months ago
- ☆14Aug 3, 2024Updated last year