VITA-Group/LoCoCo

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/VITA-Group/LoCoCo)

VITA-Group / LoCoCo

[ICML‘2024] "LoCoCo: Dropping In Convolutions for Long Context Compression", Ruisi Cai, Yuandong Tian, Zhangyang Wang, Beidi Chen

☆17

Alternatives and similar repositories for LoCoCo

Users that are interested in LoCoCo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

huangyuxiang03 / Locret
View on GitHub
☆14Oct 3, 2024Updated last year
DAMO-NLP-SG / LongPO
View on GitHub
[ICLR 2025] LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization
☆43Feb 27, 2025Updated last year
Hritikbansal / jpo
View on GitHub
☆13Jul 2, 2025Updated last year
thunlp / APB
View on GitHub
Official Implementation of APB (ACL 2025 main Oral) and Spava (ACL 2026 main).
☆37Apr 6, 2026Updated 3 months ago
keven980716 / weak-to-strong-deception
View on GitHub
[ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"
☆15Jun 21, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
thunlp / NOSA
View on GitHub
The official implementation of NOSA
☆19Jun 11, 2026Updated last month
zeyuliu1037 / LMUFormer
View on GitHub
ICLR 2024 LMUFormer: Low Complexity Yet Powerful Spiking Model With Legendre Memory Units
☆13Sep 20, 2024Updated last year
luciferkonn / DT_Mem
View on GitHub
Thisi is the official code base for paper "Think Before You Act: Decision Transformers with Internal Working Memory"
☆23Jul 12, 2024Updated 2 years ago
UChi-JCL / CacheGen
View on GitHub
☆168Oct 9, 2024Updated last year
tau-nlp / zero_scrolls
View on GitHub
Running inference on the ZeroSCROLLS benchmark
☆22Apr 18, 2024Updated 2 years ago
LuJunru / MemoChat
View on GitHub
MemoChat: Tuning LLMs to Use Memos for Consistent Long-Range Open-Domain Conversation
☆29Apr 18, 2024Updated 2 years ago
VITA-Group / READ-ME
View on GitHub
[NeurIPS2024] "Read-ME: Refactorizing LLMs as Router-Decoupled Mixture of Experts with System Co-Design", Ruisi Cai, Yeonju Ro, Geon-Woo …
☆16Dec 16, 2024Updated last year
messi84 / Multiple-Adversarial_Examples_attack
View on GitHub
六代兴亡如梦，苒苒惊时月。纵使岁寒途远，此志应难夺。
☆11Mar 15, 2020Updated 6 years ago
tumaer / sph-hae
View on GitHub
[GSI 2023] Learning Lagrangian Fluid Mechanics with E(3)-Equivariant GNNs
☆15Jun 3, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ant-research / long-context-modeling
View on GitHub
Research work aimed at addressing the problem of modeling infinite-length context
☆50Dec 18, 2025Updated 7 months ago
Doraemonzzz / nanoTransNormer
View on GitHub
☆11Oct 11, 2023Updated 2 years ago
webis-de / set-encoder
View on GitHub
Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders
☆19May 23, 2025Updated last year
xAlg-ai / HashAttention-1.0
View on GitHub
☆18Sep 23, 2025Updated 10 months ago
fasa-org / dash-attention
View on GitHub
DashAttention: Differentiable and Adaptive Sparse Hierarchical Attention
☆22May 25, 2026Updated last month
VITA-Group / Random-Shuffling-BackdoorDetect
View on GitHub
[NeurIPS 2022] "Randomized Channel Shuffling: Minimal-Overhead Backdoor Attack Detection without Clean Datasets" by Ruisi Cai*, Zhenyu Zh…
☆21Oct 1, 2022Updated 3 years ago
schwartz-lab-NLP / TOVA
View on GitHub
Token Omission Via Attention
☆131Oct 13, 2024Updated last year
nightdessert / Retrieval_Head
View on GitHub
open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality
☆241Aug 2, 2024Updated last year
whyNLP / LCKV
View on GitHub
Layer-Condensed KV cache w/ 10 times larger batch size, fewer params and less computation. Dramatic speed up with better task performance…
☆157Apr 7, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
kaistAI / knowledge-reasoning
View on GitHub
[EMNLP 2024] Official implementation of "Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Ut…
☆23Dec 4, 2024Updated last year
ThisisBillhe / ZipCache
View on GitHub
[NeurIPS 2024] The official implementation of ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification
☆33Mar 30, 2025Updated last year
lfsszd / CS-Drafting
View on GitHub
Cascade Speculative Drafting
☆33Apr 2, 2024Updated 2 years ago
VITA-Group / R-Sparse
View on GitHub
[ICLR'25] R-Sparse: Rank-Aware Activation Sparsity for Efficient LLM Inference
☆21Apr 28, 2025Updated last year
KuaiSearchPERKS / PERKS
View on GitHub
KuaiSearch PERKS
☆12Nov 16, 2021Updated 4 years ago
FYYFU / HeadKV
View on GitHub
[ICLR2025] Code and data for paper: Not All Heads Matter: A Head-Level KV Cache Compression Method with Integrated Retrieval and Reasonin…
☆45Mar 10, 2025Updated last year
robclark / egl_dma_buf
View on GitHub
simple dmabuf eglimage example
☆10Sep 18, 2014Updated 11 years ago
bentherien / mu_learned_optimization
View on GitHub
[Poster; ICLR 2026] [Oral; Neurips OPT2024] μLO: Compute-Efficient Meta-Generalization of Learned Optimizers
☆16Apr 15, 2026Updated 3 months ago
jedisct1 / simpira384
View on GitHub
An AES-based 384 bit permutation.
☆21May 3, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
aneeshan95 / Cross-modal_Hierarchy_FGSBIR
View on GitHub
Cross-modal Hierarchical Modelling for FGSBIR. Work accepted for Oral presentation in BMVC 2020
☆18Sep 8, 2023Updated 2 years ago
alessiodevoto / l2compress
View on GitHub
Code for the EMNLP24 paper "A simple and effective L2 norm based method for KV Cache compression."
☆19Dec 13, 2024Updated last year
Bulat-Ziganshin / ECC-Benchmark
View on GitHub
Comparison of leading error-correcting code implementations
☆12Aug 19, 2022Updated 3 years ago
gtolias / mkd
View on GitHub
MATLAB implementation of the multiple-kernel local-patch descriptor (BMVC 2017 paper)
☆14Jan 31, 2018Updated 8 years ago
lasr-spelling / sae-spelling
View on GitHub
Code for the paper "A is for Absorption: Studying Feature Splitting and Absorption in Sparse Autoencoders"
☆15Dec 28, 2025Updated 6 months ago
mit-gfx / recfilter
View on GitHub
Domain-specific language for IIR filters
☆17Sep 28, 2016Updated 9 years ago
leibovit / Sparse-Linear-Networks
View on GitHub
Code to accompany the paper Sparse Linear Networks with a Fixed Butterfly Structure: Theory and Practice
☆10Aug 10, 2021Updated 4 years ago