On the Theoretical Limitations of Embedding-Based Retrieval
☆650Sep 15, 2025Updated 8 months ago
Alternatives and similar repositories for limit
Users that are interested in limit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"☆20Mar 31, 2025Updated last year
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated 3 months ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆19May 23, 2025Updated 11 months ago
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆205May 3, 2025Updated last year
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆52Jan 6, 2026Updated 4 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Provides a common interface to many IR ranking datasets.☆390Apr 10, 2026Updated last month
- Official repository of the Seismic library.☆124Apr 8, 2026Updated last month
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval☆64Jun 20, 2024Updated last year
- The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGI…☆16May 4, 2022Updated 4 years ago
- ☆18Aug 21, 2025Updated 9 months ago
- Anemoi: A Semi-Centralized Multi-agent Systems Based on Agent-to-Agent Communication MCP server from Coral Protocol☆371Aug 27, 2025Updated 8 months ago
- Unified Learned Sparse Retrieval Framework☆68May 13, 2024Updated 2 years ago
- Implement a reasoning LLM in PyTorch from scratch, step by step☆4,346Apr 21, 2026Updated last month
- Late Interaction Models Training & Retrieval☆811May 11, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [SIGIR 2024] The official repo for paper "Planning Ahead in Generative Retrieval: Guiding Autoregressive Generation through Simultaneous …☆31Apr 24, 2024Updated 2 years ago
- Reached #13 on Stanford's Terminal Bench leaderboard. Orchestrator, explorer & coder agents working together with intelligent context sha…☆1,375Nov 3, 2025Updated 6 months ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Jul 25, 2023Updated 2 years ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆225Dec 16, 2025Updated 5 months ago
- The github repository of paper "Understanding Differential Search Index for Text Retrieval" in ACL2023 Findings..☆16May 21, 2023Updated 3 years ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆227May 6, 2026Updated 2 weeks ago
- Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration☆15Jun 4, 2024Updated last year
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆594Updated this week
- code for training & evaluating Contextual Document Embedding models☆205May 14, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆19May 16, 2024Updated 2 years ago
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆200Sep 13, 2025Updated 8 months ago
- Semantic search and document parsing tools for the command line☆1,793Mar 11, 2026Updated 2 months ago
- An MCP server exposing full Chrome DevTools Protocol debugging: breakpoints, step/run, call stacks, eval, and source maps.☆343Oct 2, 2025Updated 7 months ago
- Fast BM25 search in Python, powered by Numpy and Numba☆1,674Updated this week
- The official repo for our SIGIR'23 Full paper: Constructing Tree-based Index for Efficient and Effective Dense Retrieval☆28Jun 7, 2023Updated 2 years ago
- LangGraph template for a simple ReAct agent, with MCP tools support and robust test suites.☆556Sep 28, 2025Updated 7 months ago
- ☆22Sep 26, 2024Updated last year
- Tree-based indexes for neural-search☆33Mar 4, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- LATTICE turns retrieval into an LLM-driven navigation problem over a semantic scaffold☆37Mar 9, 2026Updated 2 months ago
- The first dense retrieval model that can be prompted like an LM☆92May 8, 2025Updated last year
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆25Apr 4, 2025Updated last year
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆54Jul 3, 2024Updated last year
- ☆39Nov 21, 2022Updated 3 years ago
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆16Sep 4, 2025Updated 8 months ago
- High performance implementation of the WARP (SIGIR'25) retrieval engine.☆33Apr 28, 2026Updated 3 weeks ago