[ACL 2025 Main] Repository for the paper: 500xCompressor: Generalized Prompt Compression for Large Language Models
☆61Mar 9, 2026Updated last month
Alternatives and similar repositories for 500xCompressor
Users that are interested in 500xCompressor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Accommodating Large Language Model Training over Heterogeneous Environment.☆28Mar 13, 2025Updated last year
- [EMNLP 2024] CompAct: Compressing Retrieved Documents Actively for Question Answering☆38Sep 20, 2024Updated last year
- [NAACL 2025 Main Selected Oral] Repository for the paper: Prompt Compression for Large Language Models: A Survey☆37May 18, 2025Updated 10 months ago
- The repo for In-context Autoencoder☆168May 11, 2024Updated last year
- ☆18Dec 2, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- A Mechanistic‑Interpretability study that finds the structural dynamics of Large Language Models under fine‑tuning.☆16May 30, 2025Updated 10 months ago
- Compression for Foundation Models☆35Jul 21, 2025Updated 8 months ago
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆41Oct 17, 2023Updated 2 years ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆174Jul 4, 2024Updated last year
- Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304☆14Oct 11, 2022Updated 3 years ago
- ☆20Aug 14, 2025Updated 8 months ago
- Code for "Retaining Key Information under High Compression Rates: Query-Guided Compressor for LLMs" (ACL 2024)☆19Jun 12, 2024Updated last year
- Official Repo for FoodieQA paper (EMNLP 2024)☆20Jun 26, 2025Updated 9 months ago
- ☆12Nov 15, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆33May 29, 2024Updated last year
- [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.☆227May 31, 2025Updated 10 months ago
- docker:dind with NVIDIA GPU support via NVIDIA container toolkit☆13Apr 1, 2026Updated 2 weeks ago
- ☆53Oct 29, 2024Updated last year
- [ACL 2024] Long-Context Language Modeling with Parallel Encodings☆170Jun 13, 2024Updated last year
- The evaluation framework for training-free sparse attention in LLMs☆122Jan 27, 2026Updated 2 months ago
- ☆20Oct 31, 2022Updated 3 years ago
- Better Transition-Based AMR Parsing with a Refined Search Space (authors' DyNet implementation for the EMNLP18 paper)☆10Jun 13, 2019Updated 6 years ago
- Official repository of paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"☆24May 27, 2025Updated 10 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".☆16Sep 15, 2024Updated last year
- Multi-Stage Vision Token Dropping: Towards Efficient Multimodal Large Language Model☆36Jan 8, 2025Updated last year
- A Python package that enables the creation and parsing of structured prompts for language models in markdown format☆16Jan 9, 2026Updated 3 months ago
- CrewAI AgentOps: Monitor your AI Agents☆19Jun 29, 2024Updated last year
- 雨课堂线上课划水小助手☆23Mar 16, 2026Updated last month
- This repository provides the official PyTorch implementation for the AAAI 2026 Oral paper "Inductive Generative Recommendation via Retrie…☆37Jan 15, 2026Updated 3 months ago
- Research work aimed at addressing the problem of modeling infinite-length context☆48Dec 18, 2025Updated 3 months ago
- Code for the EACL 2024 paper: "Small Language Models Improve Giants by Rewriting Their Outputs"☆12Apr 20, 2024Updated last year
- Leveraging passage embeddings for efficient listwise reranking with large language models.☆51Dec 7, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A reimplementation of KOSMOS-1 from "Language Is Not All You Need: Aligning Perception with Language Models"☆27Mar 3, 2023Updated 3 years ago
- An extention to the GaLore paper, to perform Natural Gradient Descent in low rank subspace☆18Oct 21, 2024Updated last year
- Spectral Graph Attention Network with Fast Eigen-approximation☆11Dec 24, 2021Updated 4 years ago
- Official implementation of Hierarchical Context Merging: Better Long Context Understanding for Pre-trained LLMs (ICLR 2024).☆43Aug 6, 2024Updated last year
- Learning to Compress Prompts with Gist Tokens - https://arxiv.org/abs/2304.08467☆316Feb 14, 2025Updated last year
- ☆17Feb 20, 2023Updated 3 years ago
- The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.☆50Oct 18, 2024Updated last year