RulinShao / retrieval-scalingView external linksLinks
Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".
β224Dec 16, 2025Updated 2 months ago
Alternatives and similar repositories for retrieval-scaling
Users that are interested in retrieval-scaling are comparing it to the libraries listed below
Sorting:
- Python package for serving a local search engine. One command to download and serve a datastore---that's it π.β25Jun 6, 2025Updated 8 months ago
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrievalβ189Sep 13, 2025Updated 5 months ago
- FlexAttention w/ FlashAttention3 Supportβ27Oct 5, 2024Updated last year
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".β218Jun 24, 2025Updated 7 months ago
- [SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"β19Mar 31, 2025Updated 10 months ago
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"β78Nov 25, 2024Updated last year
- Resolving Knowledge Conflicts in Large Language Models, COLM 2024β18Oct 7, 2025Updated 4 months ago
- train with kittens!β63Oct 25, 2024Updated last year
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.β576Updated this week
- Retrieval-Augmented Generation battle!β62Jul 31, 2025Updated 6 months ago
- β20Nov 4, 2025Updated 3 months ago
- This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Ajiβ¦β240Nov 3, 2023Updated 2 years ago
- Model implementation for the contextual embeddings projectβ40Jun 2, 2025Updated 8 months ago
- code for training & evaluating Contextual Document Embedding modelsβ202May 14, 2025Updated 9 months ago
- QLoRA for Masked Language Modelingβ22Sep 11, 2023Updated 2 years ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.β96Feb 9, 2023Updated 3 years ago
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Trainingβ23Aug 18, 2024Updated last year
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encodersβ18May 23, 2025Updated 8 months ago
- Code for ICLR 2025 Paper "What is Wrong with Perplexity for Long-context Language Modeling?"β109Oct 11, 2025Updated 4 months ago
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"β48Jan 17, 2024Updated 2 years ago
- Memory layers use a trainable key-value lookup mechanism to add extra parameters to a model without increasing FLOPs. Conceptually, sparsβ¦β371Dec 12, 2024Updated last year
- minimal pytorch implementation of bm25 (with sparse tensors)β104Oct 28, 2025Updated 3 months ago
- Source-to-Source Debuggable Derivatives in Pure Pythonβ15Jan 23, 2024Updated 2 years ago
- "Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding" Zhenyu Zhang, Runjin Chen, Shiwβ¦β31May 7, 2024Updated last year
- β19Mar 25, 2025Updated 10 months ago
- Parallel Associative Scan for Language Modelsβ18Jan 8, 2024Updated 2 years ago
- β316Jun 21, 2024Updated last year
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our focβ¦β32Jun 13, 2024Updated last year
- Scalable training for dense retrieval models.β298Jun 10, 2025Updated 8 months ago
- Official repository for DistFlashAttn: Distributed Memory-efficient Attention for Long-context LLMs Trainingβ222Aug 19, 2024Updated last year
- A framework to study AI models in Reasoning, Alignment, and use of Memory (RAM).β344Dec 16, 2025Updated 2 months ago
- Generative Representational Instruction Tuningβ686Jun 25, 2025Updated 7 months ago
- [EMNLP 2022] This is the code repo for our EMNLPβ22 paper "COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrβ¦β50Oct 12, 2023Updated 2 years ago
- Document Ranking with Large Language Models.β202Updated this week
- β20May 30, 2024Updated last year
- Dense X Retrieval: What Retrieval Granularity Should We Use?β168Jan 8, 2024Updated 2 years ago
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrievalβ60Jun 20, 2024Updated last year
- Understand and test language model architectures on synthetic tasks.β252Jan 12, 2026Updated last month
- Code for exploring Based models from "Simple linear attention language models balance the recall-throughput tradeoff"β248Jun 6, 2025Updated 8 months ago