RulinShao/retrieval-scaling

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RulinShao/retrieval-scaling)

RulinShao / retrieval-scaling

Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".

☆226

Alternatives and similar repositories for retrieval-scaling

Users that are interested in retrieval-scaling are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

RulinShao / massive-serve
View on GitHub
Python package for serving a local search engine. One command to download and serve a datastore---that's it 😎.
☆26Jun 6, 2025Updated last year
xlang-ai / BRIGHT
View on GitHub
[ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
☆210Sep 13, 2025Updated 10 months ago
HansiZeng / scaling-retriever
View on GitHub
[SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"
☆22Mar 31, 2025Updated last year
facebookresearch / ReasonIR
View on GitHub
Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".
☆230Jul 2, 2026Updated 3 weeks ago
GindaChen / FlexFlashAttention3
View on GitHub
FlexAttention w/ FlashAttention3 Support
☆27Oct 5, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
yikee / Knowledge_Conflict
View on GitHub
Resolving Knowledge Conflicts in Large Language Models, COLM 2024
☆18Oct 7, 2025Updated 9 months ago
jataware / XRR2
View on GitHub
Expand -> Retrieve -> Rerank - simple method with strong results on BRIGHT benchmark
☆22Aug 22, 2025Updated 11 months ago
castorini / rank_llm
View on GitHub
RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.
☆611Jul 19, 2026Updated last week
formll / resolving-scaling-law-discrepancies
View on GitHub
☆19Nov 4, 2025Updated 8 months ago
facebookresearch / tart
View on GitHub
Code and model release for the paper "Task-aware Retrieval with Instructions" by Asai et al.
☆168Oct 4, 2023Updated 2 years ago
facebookresearch / dpr-scale
View on GitHub
Scalable training for dense retrieval models.
☆298Jul 2, 2026Updated 3 weeks ago
OpenMatch / COCO-DR
View on GitHub
[EMNLP 2022] This is the code repo for our EMNLP‘22 paper "COCO-DR: Combating Distribution Shifts in Zero-Shot Dense Retrieval with Contr…
☆51Oct 12, 2023Updated 2 years ago
yuzhaouoe / pretraining-data-packing
View on GitHub
[ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training
☆24Aug 18, 2024Updated last year
ChrisHayduk / QLoRA-for-MLM
View on GitHub
QLoRA for Masked Language Modeling
☆23Sep 11, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
HazyResearch / train-tk
View on GitHub
train with kittens!
☆67Oct 25, 2024Updated last year
kernelmachine / silo-lm
View on GitHub
SILO Language Models code repository
☆83Feb 23, 2024Updated 2 years ago
castorini / ragnarok
View on GitHub
Retrieval-Augmented Generation battle!
☆66Apr 18, 2026Updated 3 months ago
seanmacavaney / plaidrepro
View on GitHub
☆11Feb 9, 2024Updated 2 years ago
chentong0 / factoid-wiki
View on GitHub
Dense X Retrieval: What Retrieval Granularity Should We Use?
☆171Jan 8, 2024Updated 2 years ago
swj0419 / detect-pretrain-code
View on GitHub
This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Aji…
☆243Nov 3, 2023Updated 2 years ago
HKUNLP / STRING
View on GitHub
[ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"
☆82Nov 25, 2024Updated last year
facebookresearch / NPM
View on GitHub
The original implementation of Min et al. "Nonparametric Masked Language Modeling" (paper https//arxiv.org/abs/2212.01349)
☆159Jan 6, 2023Updated 3 years ago
jlscheerer / xtr-warp
View on GitHub
XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.
☆211May 3, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
huggingface / olm-training
View on GitHub
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆98Feb 9, 2023Updated 3 years ago
jxmorris12 / cde
View on GitHub
code for training & evaluating Contextual Document Embedding models
☆207May 14, 2025Updated last year
lightonai / fast-plaid
View on GitHub
High-Performance Engine for Multi-Vector Search
☆271May 28, 2026Updated 2 months ago
StarTrail-org / RAG-DS-Serve
View on GitHub
[AAAI26]: DS SERVE: The Largest Open Vector Store over Pretain Data; A Framework for Efficient and Scalable Neural Retrieval
☆53Jan 28, 2026Updated 6 months ago
Leooyii / LCEG
View on GitHub
[COLM'25] A Controlled Study on Long Context Extension and Generalization in LLMs
☆65Mar 9, 2026Updated 4 months ago
texttron / tevatron
View on GitHub
Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.
☆743Jul 18, 2026Updated last week
ContextualAI / gritlm
View on GitHub
Generative Representational Instruction Tuning
☆697Jun 25, 2025Updated last year
orionw / FollowIR
View on GitHub
FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions
☆56Jul 3, 2024Updated 2 years ago
OpenMatch / MARVEL
View on GitHub
[ACL 2024 Oral] This is the code repo for our ACL‘24 paper "MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Mo…
☆39Jun 30, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kookeej / CORAL
View on GitHub
Implementation of NAACL'25 "Empowering Retrieval-based Conversational Recommendation with Contrasting User Preferences"
☆14Sep 9, 2025Updated 10 months ago
srush / tangent
View on GitHub
Source-to-Source Debuggable Derivatives in Pure Python
☆15Jan 23, 2024Updated 2 years ago
sher222 / LeReT
View on GitHub
Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval
☆52Oct 31, 2024Updated last year
OpenMatch / ANCE-Tele
View on GitHub
Code and data of the EMNLP 2022 Main Conference paper "Reduce Catastrophic Forgetting of Dense Retrieval Training with Teleportation Nega…
☆18Mar 25, 2024Updated 2 years ago
google-deepmind / loft
View on GitHub
LOFT: A 1 Million+ Token Long-Context Benchmark
☆237Apr 13, 2026Updated 3 months ago
ielab / llm-rankers
View on GitHub
Document Ranking with Large Language Models.
☆210Feb 14, 2026Updated 5 months ago
RulinShao / LightSeq
View on GitHub
Official repository for DistFlashAttn: Distributed Memory-efficient Attention for Long-context LLMs Training
☆223Aug 19, 2024Updated last year