Hannibal046/xRAG

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Hannibal046/xRAG)

Hannibal046 / xRAG

[Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token

☆184

Alternatives and similar repositories for xRAG

Users that are interested in xRAG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

carriex / recomp
View on GitHub
RECOMP: Improving Retrieval-Augmented LMs with Compression and Selective Augmentation.
☆148Jan 6, 2026Updated 6 months ago
yanhong-lbh / text_or_pixels
View on GitHub
Codebase for EMNLP 2025 Findings paper "Text or Pixels? Evaluating Efficiency and Understanding of LLMs with Visual Text Inputs"
☆19Nov 14, 2025Updated 8 months ago
Workday / cpc
View on GitHub
☆26Jan 16, 2025Updated last year
getao / icae
View on GitHub
The repo for In-context Autoencoder
☆174May 11, 2024Updated 2 years ago
dtunai / Griffin-Jax
View on GitHub
Jax implementation of "Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models"
☆15May 10, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
eunseongc / MGFiD_NAACL2024F
View on GitHub
The official repository for MGFiD (NAACL 2024 Findings)
☆15Jul 27, 2024Updated last year
ZongqianLi / Prompt-Compression-Survey
View on GitHub
[NAACL 2025 Main Selected Oral] Repository for the paper: Prompt Compression for Large Language Models: A Survey
☆36May 18, 2025Updated last year
Hannibal046 / SelfMemory
View on GitHub
[Neurips2023] Source code for Lift Yourself Up: Retrieval-augmented Text Generation with Self Memory
☆62May 24, 2023Updated 3 years ago
HansiZeng / scaling-retriever
View on GitHub
[SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"
☆22Mar 31, 2025Updated last year
oriyor / ret-robust
View on GitHub
Implementation of the paper: "Making Retrieval-Augmented Language Models Robust to Irrelevant Context"
☆77Aug 6, 2024Updated last year
princeton-nlp / AutoCompressors
View on GitHub
[EMNLP 2023] Adapting Language Models to Compress Long Contexts
☆337Sep 9, 2024Updated last year
GSYfate / knnlm-limits
View on GitHub
Official code repo for paper "Great Memory, Shallow Reasoning: Limits of kNN-LMs"
☆24Apr 30, 2025Updated last year
swj0419 / in-context-pretraining
View on GitHub
☆57Apr 11, 2024Updated 2 years ago
zorazrw / filco
View on GitHub
[Preprint] Learning to Filter Context for Retrieval-Augmented Generaton
☆198Apr 6, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
yale-nlp / InstruSum
View on GitHub
☆23Feb 26, 2024Updated 2 years ago
dengc2023 / LongDocURL
View on GitHub
☆42Apr 6, 2026Updated 3 months ago
ZongqianLi / 500xCompressor
View on GitHub
[ACL 2025 Main] Repository for the paper: 500xCompressor: Generalized Prompt Compression for Large Language Models
☆64Mar 9, 2026Updated 4 months ago
lfy79001 / RegHNT
View on GitHub
Code for COLING 2022 long paper: Answering Numerical Reasoning Questions in Table-Text Hybrid Contents with Graph-based Encoder and Tree-…
☆22Dec 15, 2022Updated 3 years ago
HKUNLP / ChunkLlama
View on GitHub
[ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"
☆450Oct 16, 2024Updated last year
liuqi6777 / pe_rank
View on GitHub
Leveraging passage embeddings for efficient listwise reranking with large language models.
☆51Dec 7, 2024Updated last year
dongguanting / DPA-RAG
View on GitHub
The code and data of DPA-RAG, accepted by WWW 2025 main conference.
☆68Oct 23, 2025Updated 9 months ago
runchu-tian / LongPiBench
View on GitHub
The repository for papaer "Distance between Relevant Information Pieces Causes Bias in Long-Context LLMs"
☆14Dec 16, 2024Updated last year
kookeej / CORAL
View on GitHub
Implementation of NAACL'25 "Empowering Retrieval-based Conversational Recommendation with Contrasting User Preferences"
☆14Sep 9, 2025Updated 10 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
weizhepei / InstructRAG
View on GitHub
[ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales
☆151Apr 26, 2026Updated 3 months ago
ielab / PromptReps
View on GitHub
Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval
☆52Jan 6, 2026Updated 6 months ago
princeton-nlp / CEPE
View on GitHub
[ACL 2024] Long-Context Language Modeling with Parallel Encodings
☆169Jun 13, 2024Updated 2 years ago
alkaidpku / DQ-ToolQA
View on GitHub
☆10Nov 15, 2023Updated 2 years ago
naver / bergen
View on GitHub
Benchmarking library for RAG
☆276Jul 14, 2026Updated last week
sunnweiwei / MAIR
View on GitHub
MAIR: A Massive Benchmark for Evaluating Instructed Retrieval. Evaluate your retrieval models on 126 diverse tasks. [EMNLP 2024]
☆28Nov 3, 2024Updated last year
AkariAsai / self-rag
View on GitHub
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,…
☆2,410May 25, 2024Updated 2 years ago
ContextualAI / gritlm
View on GitHub
Generative Representational Instruction Tuning
☆697Jun 25, 2025Updated last year
evalplus / repoqa
View on GitHub
RepoQA: Evaluating Long-Context Code Understanding
☆136Nov 1, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
spcl / MRAG
View on GitHub
Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"
☆242Feb 26, 2026Updated 5 months ago
dwzhu-pku / LongEmbed
View on GitHub
LongEmbed: Extending Embedding Models for Long Context Retrieval (EMNLP 2024)
☆148Nov 9, 2024Updated last year
xsc1234 / INFO-RAG
View on GitHub
Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation
☆57Dec 25, 2024Updated last year
zhichaoxu-shufe / RankMamba
View on GitHub
☆18Mar 30, 2024Updated 2 years ago
Leooyii / LCEG
View on GitHub
[COLM'25] A Controlled Study on Long Context Extension and Generalization in LLMs
☆65Mar 9, 2026Updated 4 months ago
facebookresearch / atlas
View on GitHub
Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03…
☆560Jul 2, 2026Updated 3 weeks ago
wjn1996 / InstructGraph
View on GitHub
A framework to empover LLMs on graph reasoning and generation. Refer to our paper: https://arxiv.org/pdf/2402.08785.pdf
☆79Jul 29, 2024Updated last year