ZongqianLi/500xCompressor

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ZongqianLi/500xCompressor)

ZongqianLi / 500xCompressor

[ACL 2025 Main] Repository for the paper: 500xCompressor: Generalized Prompt Compression for Large Language Models

☆64

Alternatives and similar repositories for 500xCompressor

Users that are interested in 500xCompressor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ZongqianLi / Prompt-Compression-Survey
View on GitHub
[NAACL 2025 Main Selected Oral] Repository for the paper: Prompt Compression for Large Language Models: A Survey
☆36May 18, 2025Updated last year
getao / icae
View on GitHub
The repo for In-context Autoencoder
☆174May 11, 2024Updated 2 years ago
dmis-lab / CompAct
View on GitHub
[EMNLP 2024] CompAct: Compressing Retrieved Documents Actively for Question Answering
☆37Sep 20, 2024Updated last year
YichenZW / Pacing
View on GitHub
This repository includes the code implementation of the paper Improving Pacing in Long-Form Story Planning by Yichen Wang, Kevin Yang, Xi…
☆19Nov 19, 2024Updated last year
yurakuratov / hidden_capacity
View on GitHub
Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity (ACL 2025, oral)
☆35Jun 14, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
leezythu / FocusLLM
View on GitHub
FocusLLM: Scaling LLM’s Context by Parallel Decoding
☆45Dec 8, 2024Updated last year
gmftbyGMFTBY / Rep-Dropout
View on GitHub
[NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective
☆41Oct 17, 2023Updated 2 years ago
czx-li / DP2O
View on GitHub
Accompanying repo for the DP2O paper accepted by AAAI 2024 main conference
☆17Mar 28, 2024Updated 2 years ago
Hannibal046 / xRAG
View on GitHub
[Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token
☆184Jul 4, 2024Updated 2 years ago
allenai / hyperdecoders
View on GitHub
Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304
☆14Oct 11, 2022Updated 3 years ago
wenyudu / MIGU
View on GitHub
[EMNLP 2024 Findings] Unlocking Continual Learning Abilities in Language Models
☆26Oct 8, 2024Updated last year
wutaiqiang / MI
View on GitHub
Official code for paper "Revisiting Model Interpolation for Efficient Reasoning"
☆17Jul 14, 2026Updated last week
XMUDeepLIT / QGC
View on GitHub
Code for "Retaining Key Information under High Compression Rates: Query-Guided Compressor for LLMs" (ACL 2024)
☆20Jun 12, 2024Updated 2 years ago
aaronmueller / MIB
View on GitHub
Landing page for MIB: A Mechanistic Interpretability Benchmark
☆26Aug 15, 2025Updated 11 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
eric-mitchell / concord
View on GitHub
☆14Nov 15, 2022Updated 3 years ago
krafton-ai / lexico
View on GitHub
KV cache compression via sparse coding
☆17Oct 26, 2025Updated 9 months ago
zhaochenyang20 / Prompt2Model-Self-Guide
View on GitHub
SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper
☆34May 29, 2024Updated 2 years ago
MILVLG / twigvlm
View on GitHub
Implementation of ICCV 2025 paper "Growing a Twig to Accelerate Large Vision-Language Models".
☆30May 23, 2026Updated 2 months ago
hao-ai-lab / Dynasor
View on GitHub
[NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.
☆232May 31, 2025Updated last year
chijames / KERPLE
View on GitHub
☆20Oct 25, 2022Updated 3 years ago
Kaffaljidhmah2 / SpecDec_pp
View on GitHub
Repository for the COLM 2025 paper SpecDec++: Boosting Speculative Decoding via Adaptive Candidate Lengths
☆19Jul 10, 2025Updated last year
wdlctc / mini-s
View on GitHub
☆51Oct 29, 2024Updated last year
THUDM / GLM-iprompt
View on GitHub
Apply Iprompt on GLM with innovative new methods. Currently support Chinese QA, English QA and Chinese poem generation.
☆20Jun 16, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
princeton-pli / QRHead
View on GitHub
QRHead: Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking
☆40Jan 20, 2026Updated 6 months ago
dmg-illc / JUDGE-BENCH
View on GitHub
☆40Jul 24, 2025Updated last year
keeeeenw / TinyLlama
View on GitHub
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
☆14Mar 30, 2024Updated 2 years ago
Extrality / nvidia-dind
View on GitHub
docker:dind with NVIDIA GPU support via NVIDIA container toolkit
☆14Jul 1, 2026Updated 3 weeks ago
cambridgeltl / ECNMT
View on GitHub
Emergent Communication Pretraining for Few-Shot Machine Translation
☆13Dec 3, 2020Updated 5 years ago
hsajjad / Interpretability-Tutorial-NAACL2021
View on GitHub
☆24Jun 7, 2021Updated 5 years ago
fal-ai-community / alphabet-dataset
View on GitHub
Synthetic Alphabet Dataset
☆19Mar 27, 2025Updated last year
ZZZhr-1 / Robust_GUI_Grounding
View on GitHub
On the Robustness of GUI Grounding Models Against Image Attacks
☆12Apr 8, 2025Updated last year
Hao-Ning / MEIDTM-Instance-Dependent-Label-Noise-Learning-with-Manifold-Regularized-Transition-Matrix-Estimatio
View on GitHub
pytorch
☆10Apr 13, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
adithya-s-k / MoLE
View on GitHub
Mixture of Lora Experts
☆11Apr 7, 2024Updated 2 years ago
liuting20 / MustDrop
View on GitHub
Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model
☆36Jan 8, 2025Updated last year
google-research / vmf_embeddings
View on GitHub
☆10Oct 12, 2021Updated 4 years ago
66RING / CritiPrefill
View on GitHub
Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".
☆17Sep 15, 2024Updated last year
sunjie279 / SimCT-
View on GitHub
☆21May 22, 2026Updated 2 months ago
codecaution / EvoMoE
View on GitHub
☆21Oct 31, 2022Updated 3 years ago
Workday / cpc
View on GitHub
☆26Jan 16, 2025Updated last year