On the Theoretical Limitations of Embedding-Based Retrieval
☆630Sep 15, 2025Updated 5 months ago
Alternatives and similar repositories for limit
Users that are interested in limit are comparing it to the libraries listed below
Sorting:
- [SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"☆20Mar 31, 2025Updated 11 months ago
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆15Feb 9, 2026Updated 3 weeks ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 9 months ago
- Unified Learned Sparse Retrieval Framework☆68May 13, 2024Updated last year
- MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers☆450Oct 7, 2025Updated 4 months ago
- ☆18Aug 21, 2025Updated 6 months ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Jul 25, 2023Updated 2 years ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆224Dec 16, 2025Updated 2 months ago
- Official software repository of S. Bruch, F. M. Nardini, C. Rulli, and R. Venturini. "Efficient Inverted Indexes for Approximate Retrieva…☆105Jan 27, 2026Updated last month
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆64Aug 2, 2024Updated last year
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆185May 3, 2025Updated 9 months ago
- Provides a common interface to many IR ranking datasets.☆381Feb 20, 2026Updated last week
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆52Jan 6, 2026Updated last month
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆190Sep 13, 2025Updated 5 months ago
- code for training & evaluating Contextual Document Embedding models☆201May 14, 2025Updated 9 months ago
- Late Interaction Models Training & Retrieval☆732Updated this week
- Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration☆15Jun 4, 2024Updated last year
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆15Sep 4, 2025Updated 5 months ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆218Jun 24, 2025Updated 8 months ago
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval☆61Jun 20, 2024Updated last year
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆579Updated this week
- ☆60Jan 12, 2026Updated last month
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆31Updated this week
- [CVPR 2025] Official Implementation of LOCORE: Image Re-ranking with Long-Context☆15Apr 15, 2025Updated 10 months ago
- Anemoi: A Semi-Centralized Multi-agent Systems Based on Agent-to-Agent Communication MCP server from Coral Protocol☆372Aug 27, 2025Updated 6 months ago
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆52Jul 3, 2024Updated last year
- [SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrieval☆36Oct 18, 2024Updated last year
- Reached #13 on Stanford's Terminal Bench leaderboard. Orchestrator, explorer & coder agents working together with intelligent context sha…☆1,332Nov 3, 2025Updated 3 months ago
- Implement a reasoning LLM in PyTorch from scratch, step by step☆3,211Updated this week
- Scalable training for dense retrieval models.☆298Jun 10, 2025Updated 8 months ago
- Inquisitive Parrots for Search☆199Jun 5, 2025Updated 8 months ago
- Checkpoint-engine is a simple middleware to update model weights in LLM inference engines☆912Updated this week
- Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy☆1,500Feb 17, 2026Updated last week
- ☆19May 16, 2024Updated last year
- A large-scale information-rich web dataset, featuring millions of real clicked query-document labels☆346Dec 16, 2024Updated last year
- ☆34May 14, 2025Updated 9 months ago
- PyTorch code for "ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning"☆21Oct 28, 2024Updated last year
- ☆43Apr 22, 2025Updated 10 months ago
- The first dense retrieval model that can be prompted like an LM☆90May 8, 2025Updated 9 months ago