[ICML‘2024] "LoCoCo: Dropping In Convolutions for Long Context Compression", Ruisi Cai, Yuandong Tian, Zhangyang Wang, Beidi Chen
☆17Sep 7, 2024Updated last year
Alternatives and similar repositories for LoCoCo
Users that are interested in LoCoCo are comparing it to the libraries listed below
Sorting:
- [GSI 2023] Learning Lagrangian Fluid Mechanics with E(3)-Equivariant GNNs☆15Jun 3, 2024Updated last year
- ☆13Jul 2, 2025Updated 8 months ago
- Latent Large Language Models☆19Aug 24, 2024Updated last year
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 9 months ago
- ☆14Oct 3, 2024Updated last year
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆14Jun 21, 2024Updated last year
- Official codebase for "Analyzing the Generalization and Reliability of Steering Vectors"☆19Dec 14, 2024Updated last year
- Cross-modal Hierarchical Modelling for FGSBIR. Work accepted for Oral presentation in BMVC 2020☆18Sep 8, 2023Updated 2 years ago
- Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."☆18Dec 13, 2024Updated last year
- [EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…☆23Dec 4, 2024Updated last year
- Official Implementation of "DeCoRe: Decoding by Contrasting Retrieval Heads to Mitigate Hallucination"☆28Dec 18, 2024Updated last year
- [ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization☆29Sep 12, 2024Updated last year
- NeurIPS 2021 | Fine-Grained Neural Network Explanation by Identifying Input Features with Predictive Information☆34Dec 13, 2021Updated 4 years ago
- UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Large Language Models☆52Nov 29, 2025Updated 3 months ago
- Rust bindings to https://github.com/leejet/stable-diffusion.cpp☆37Updated this week
- [NeurIPS 2024] The official implementation of ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification☆31Mar 30, 2025Updated 11 months ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆28Dec 19, 2023Updated 2 years ago
- ☆13Oct 5, 2025Updated 4 months ago
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- Official Implementation of APB (ACL 2025 main Oral) and Spava.☆34Jan 30, 2026Updated last month
- Official software repository of the kANNolo library.☆48Jan 23, 2026Updated last month
- FGVCLib is an open-source and well documented library for Fine-grained Visual Classification.☆40Dec 9, 2023Updated 2 years ago
- Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"☆82Apr 12, 2024Updated last year
- open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality☆232Aug 2, 2024Updated last year
- Token Omission Via Attention☆127Oct 13, 2024Updated last year
- ☆11Dec 23, 2024Updated last year
- ☆150Oct 9, 2024Updated last year
- Concurrency library☆17Oct 13, 2024Updated last year
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Oct 14, 2024Updated last year
- Build an AI bot in Discord to serve user's personalized reports on what's up in tech☆28Sep 14, 2025Updated 5 months ago
- Code used in the 2023 Alert Geomaterials doctoral school on Machine in Geomechanics☆13Oct 2, 2023Updated 2 years ago
- Repository with code supporting PNAS article☆11Jun 6, 2023Updated 2 years ago
- This repository contains the Parasol processor, which enables next-generation privacy preserving applications. Users can run arbitrary co…☆11Feb 25, 2026Updated last week
- Official Implementation for [ICLR26] DefensiveKV: Taming the Fragility of KV Cache Eviction in LLM Inference☆22Feb 9, 2026Updated 3 weeks ago
- Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance…☆156Apr 7, 2025Updated 10 months ago
- [ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization☆43Feb 27, 2025Updated last year
- Material parsers and other tools, scripts Initially developed for Grobid Superconductor☆13Feb 21, 2025Updated last year
- my profile readme☆14Updated this week
- Models for packages and the resources they contain.☆14Mar 10, 2024Updated last year