Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".
β224Dec 16, 2025Updated 3 months ago
Alternatives and similar repositories for retrieval-scaling
Users that are interested in retrieval-scaling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python package for serving a local search engine. One command to download and serve a datastore---that's it π.β25Jun 6, 2025Updated 9 months ago
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrievalβ195Sep 13, 2025Updated 6 months ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".β226Jun 24, 2025Updated 9 months ago
- [SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"β20Mar 31, 2025Updated 11 months ago
- FlexAttention w/ FlashAttention3 Supportβ27Oct 5, 2024Updated last year
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Resolving Knowledge Conflicts in Large Language Models, COLM 2024β18Oct 7, 2025Updated 5 months ago
- Retrieval-Augmented Generation battle!β64Mar 22, 2026Updated last week
- β19Nov 4, 2025Updated 4 months ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.β586Updated this week
- Scalable training for dense retrieval models.β298Jun 10, 2025Updated 9 months ago
- [EMNLP 2022] This is the code repo for our EMNLPβ22 paper "COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contrβ¦β50Oct 12, 2023Updated 2 years ago
- Model implementation for the contextual embeddings projectβ43Jun 2, 2025Updated 9 months ago
- β10Feb 9, 2024Updated 2 years ago
- [ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Trainingβ23Aug 18, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- SILO Language Models code repositoryβ83Feb 23, 2024Updated 2 years ago
- train with kittens!β64Oct 25, 2024Updated last year
- Dense X Retrieval: What Retrieval Granularity Should We Use?β168Jan 8, 2024Updated 2 years ago
- This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Ajiβ¦β241Nov 3, 2023Updated 2 years ago
- Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.β164Oct 4, 2023Updated 2 years ago
- [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"β80Nov 25, 2024Updated last year
- The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)β158Jan 6, 2023Updated 3 years ago
- QLoRA for Masked Language Modelingβ23Sep 11, 2023Updated 2 years ago
- β20May 30, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- code for training & evaluating Contextual Document Embedding modelsβ202May 14, 2025Updated 10 months ago
- ScienceMeter: Tracking Scientific Knowledge Updates in Language Modelsβ17Jun 28, 2025Updated 9 months ago
- Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmarkβ22Aug 22, 2025Updated 7 months ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.β97Feb 9, 2023Updated 3 years ago
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.β734Jan 26, 2026Updated 2 months ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructionsβ53Jul 3, 2024Updated last year
- [COLM'25] A Controlled Study on Long Context Extension and Generalization in LLMsβ64Mar 9, 2026Updated 3 weeks ago
- Generative Representational Instruction Tuningβ689Jun 25, 2025Updated 9 months ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encodersβ18May 23, 2025Updated 10 months ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation of NAACL'25 "Empowering Retrieval-based Conversational Recommendation with Contrasting User Preferences"β14Sep 9, 2025Updated 6 months ago
- [ACL 2024 Oral] This is the code repo for our ACLβ24 paper "MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Moβ¦β39Jun 30, 2024Updated last year
- FlexiTokensβ19Dec 27, 2025Updated 3 months ago
- The evaluation framework for training-free sparse attention in LLMsβ122Jan 27, 2026Updated 2 months ago
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrievalβ61Jun 20, 2024Updated last year
- "Found in the Middle: How Language Models Use Long Contexts Better via Plug-and-Play Positional Encoding" Zhenyu Zhang, Runjin Chen, Shiwβ¦β31May 7, 2024Updated last year
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"β48Jan 17, 2024Updated 2 years ago