On the Theoretical Limitations of Embedding-Based Retrieval
☆648Sep 15, 2025Updated 7 months ago
Alternatives and similar repositories for limit
Users that are interested in limit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"☆20Mar 31, 2025Updated last year
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated 2 months ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 11 months ago
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆199May 3, 2025Updated 11 months ago
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆52Jan 6, 2026Updated 3 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Provides a common interface to many IR ranking datasets.☆389Apr 10, 2026Updated 3 weeks ago
- Official repository of the Seismic library.☆118Apr 8, 2026Updated 3 weeks ago
- MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers☆475Oct 7, 2025Updated 6 months ago
- The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGI…☆16May 4, 2022Updated 3 years ago
- ☆18Aug 21, 2025Updated 8 months ago
- Anemoi: A Semi-Centralized Multi-agent Systems Based on Agent-to-Agent Communication MCP server from Coral Protocol☆372Aug 27, 2025Updated 8 months ago
- Unified Learned Sparse Retrieval Framework☆68May 13, 2024Updated last year
- An open source web crawler that searches the internet☆259Sep 6, 2025Updated 7 months ago
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval☆63Jun 20, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Late Interaction Models Training & Retrieval☆796Updated this week
- Implement a reasoning LLM in PyTorch from scratch, step by step☆4,262Apr 21, 2026Updated last week
- [SIGIR 2024] The official repo for paper "Planning Ahead in Generative Retrieval: Guiding Autoregressive Generation through Simultaneous …☆31Apr 24, 2024Updated 2 years ago
- [SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrieval☆36Oct 18, 2024Updated last year
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Jul 25, 2023Updated 2 years ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆225Dec 16, 2025Updated 4 months ago
- The github repository of paper "Understanding Differential Search Index for Text Retrieval" in ACL2023 Findings..☆16May 21, 2023Updated 2 years ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆227Apr 8, 2026Updated 3 weeks ago
- Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration☆15Jun 4, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- MCP-Universe is a comprehensive framework designed for RL training, benchmarking, and developing AI agents for general tool-use.☆582Apr 20, 2026Updated last week
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆592Updated this week
- code for training & evaluating Contextual Document Embedding models☆203May 14, 2025Updated 11 months ago
- ☆19May 16, 2024Updated last year
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆198Sep 13, 2025Updated 7 months ago
- Semantic search and document parsing tools for the command line☆1,775Mar 11, 2026Updated last month
- Fast BM25 search in Python, powered by Numpy and Numba☆1,648Updated this week
- The official repo for our SIGIR'23 Full paper: Constructing Tree-based Index for Efficient and Effective Dense Retrieval☆28Jun 7, 2023Updated 2 years ago
- Tree-based indexes for neural-search☆33Mar 4, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- An open-source implementation of Whisper☆487Oct 29, 2025Updated 6 months ago
- LATTICE turns retrieval into an LLM-driven navigation problem over a semantic scaffold☆35Mar 9, 2026Updated last month
- The first dense retrieval model that can be prompted like an LM☆92May 8, 2025Updated 11 months ago
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆25Apr 4, 2025Updated last year
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆54Jul 3, 2024Updated last year
- ☆39Nov 21, 2022Updated 3 years ago
- High performance implementation of the WARP (SIGIR'25) retrieval engine.☆28Apr 21, 2026Updated last week