[ACL 2025 Main] Repository for the paper: 500xCompressor: Generalized Prompt Compression for Large Language Models
☆63Mar 9, 2026Updated 3 months ago
Alternatives and similar repositories for 500xCompressor
Users that are interested in 500xCompressor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [EMNLP 2024] CompAct: Compressing Retrieved Documents Actively for Question Answering☆37Sep 20, 2024Updated last year
- Accommodating Large Language Model Training over Heterogeneous Environment.☆31Mar 13, 2025Updated last year
- [NAACL 2025 Main Selected Oral] Repository for the paper: Prompt Compression for Large Language Models: A Survey☆37May 18, 2025Updated last year
- ☆18Dec 2, 2024Updated last year
- A Mechanistic‑Interpretability study that finds the structural dynamics of Large Language Models under fine‑tuning.☆17May 30, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Official implementation of the paper "Pretraining Language Models to Ponder in Continuous Space"☆26Jul 21, 2025Updated 10 months ago
- FocusLLM: Scaling LLM’s Context by Parallel Decoding☆45Dec 8, 2024Updated last year
- Compression for Foundation Models☆36Jul 21, 2025Updated 10 months ago
- This repository includes the code implementation of the paper Improving Pacing in Long-Form Story Planning by Yichen Wang, Kevin Yang, Xi…☆17Nov 19, 2024Updated last year
- [NeurIPS 2023] Repetition In Repetition Out: Towards Understanding Neural Text Degeneration from the Data Perspective☆41Oct 17, 2023Updated 2 years ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆181Jul 4, 2024Updated last year
- Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304☆14Oct 11, 2022Updated 3 years ago
- ☆40Jul 24, 2025Updated 10 months ago
- [EMNLP 2024 Findings] Unlocking Continual Learning Abilities in Language Models☆26Oct 8, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆20Aug 14, 2025Updated 10 months ago
- Landing page for MIB: A Mechanistic Interpretability Benchmark☆25Aug 15, 2025Updated 10 months ago
- Official Repo for FoodieQA paper (EMNLP 2024)☆20Jun 26, 2025Updated 11 months ago
- KV cache compression via sparse coding☆18Oct 26, 2025Updated 7 months ago
- ☆14Nov 15, 2022Updated 3 years ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆33May 29, 2024Updated 2 years ago
- docker:dind with NVIDIA GPU support via NVIDIA container toolkit☆14Jun 2, 2026Updated last week
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆14Mar 30, 2024Updated 2 years ago
- [NeurIPS 2025] Simple extension on vLLM to help you speed up reasoning model without training.☆230May 31, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆13Jul 14, 2024Updated last year
- official github repository of Subdora python package☆12Nov 13, 2024Updated last year
- 基于 Spring Boot 的 BOSS 直聘职位信息爬虫系统,提供自动化的职位信息采集和数据处理功能。系统采用现代化的技术栈,包括 Spring Boot 框架、SQLite 数据库和 RESTful API 设计,实现了智能的反爬虫策略和高效的数据解析能力。该系统可以…☆29Mar 16, 2025Updated last year
- AGiXT is a dynamic AI Automation Platform that seamlessly orchestrates instruction management and complex task execution across diverse A…☆25Jan 26, 2026Updated 4 months ago
- The evaluation framework for training-free sparse attention in LLMs☆123Jan 27, 2026Updated 4 months ago
- ☆21Oct 31, 2022Updated 3 years ago
- Better Transition-Based AMR Parsing with a Refined Search Space (authors' DyNet implementation for the EMNLP18 paper)☆10Jun 13, 2019Updated 7 years ago
- MUA-RL: MULTI-TURN USER-INTERACTING AGENT REINFORCEMENT LEARNING FOR AGENTIC TOOL USE☆62Nov 5, 2025Updated 7 months ago
- Official repository of paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"☆26May 27, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Code repo for "CritiPrefill: A Segment-wise Criticality-based Approach for Prefilling Acceleration in LLMs".☆17Sep 15, 2024Updated last year
- [EMNLP 2025] The official implementation of "Zero-shot Multimodal Document Retrieval via Cross-Modal Question Generation"☆15Aug 26, 2025Updated 9 months ago
- Procgen2: A community maintained fork of procgen☆12Aug 25, 2022Updated 3 years ago
- Flutter embedder for Tizen☆14Updated this week
- 雨课堂线上课划水小助手☆23May 26, 2026Updated 3 weeks ago
- MATN, SIGIR 2020☆21Jun 16, 2022Updated 4 years ago
- This repository provides the official PyTorch implementation for the AAAI 2026 Oral paper "Inductive Generative Recommendation via Retrie…☆43Jan 15, 2026Updated 5 months ago