On the Theoretical Limitations of Embedding-Based Retrieval
☆642Sep 15, 2025Updated 6 months ago
Alternatives and similar repositories for limit
Users that are interested in limit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"☆20Mar 31, 2025Updated last year
- [ICLR 26] The official code repository for the paper "Mirage or Method? How Model–Task Alignment Induces Divergent RL Conclusions".☆17Feb 9, 2026Updated 2 months ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆18May 23, 2025Updated 10 months ago
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆189May 3, 2025Updated 11 months ago
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆52Jan 6, 2026Updated 3 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Provides a common interface to many IR ranking datasets.☆386Feb 20, 2026Updated last month
- MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers☆465Oct 7, 2025Updated 6 months ago
- Official repository of the Seismic library.☆117Mar 25, 2026Updated 2 weeks ago
- The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGI…☆16May 4, 2022Updated 3 years ago
- ☆18Aug 21, 2025Updated 7 months ago
- Anemoi: A Semi-Centralized Multi-agent Systems Based on Agent-to-Agent Communication MCP server from Coral Protocol☆373Aug 27, 2025Updated 7 months ago
- An open source web crawler that searches the internet☆255Sep 6, 2025Updated 7 months ago
- Unified Learned Sparse Retrieval Framework☆68May 13, 2024Updated last year
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval☆61Jun 20, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Late Interaction Models Training & Retrieval☆783Mar 6, 2026Updated last month
- Implement a reasoning LLM in PyTorch from scratch, step by step☆4,087Updated this week
- [SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrieval☆36Oct 18, 2024Updated last year
- [SIGIR 2024] The official repo for paper "Planning Ahead in Generative Retrieval: Guiding Autoregressive Generation through Simultaneous …☆31Apr 24, 2024Updated last year
- Reached #13 on Stanford's Terminal Bench leaderboard. Orchestrator, explorer & coder agents working together with intelligent context sha…☆1,363Nov 3, 2025Updated 5 months ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆47Jul 25, 2023Updated 2 years ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆225Dec 16, 2025Updated 3 months ago
- The github repository of paper "Understanding Differential Search Index for Text Retrieval" in ACL2023 Findings..☆16May 21, 2023Updated 2 years ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆225Jun 24, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- MCP-Universe is a comprehensive framework designed for RL training, benchmarking, and developing AI agents for general tool-use.☆577Mar 25, 2026Updated 2 weeks ago
- Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration☆15Jun 4, 2024Updated last year
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆586Apr 4, 2026Updated last week
- code for training & evaluating Contextual Document Embedding models☆203May 14, 2025Updated 10 months ago
- ☆19May 16, 2024Updated last year
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆197Sep 13, 2025Updated 6 months ago
- Semantic search and document parsing tools for the command line☆1,770Mar 11, 2026Updated last month
- LangGraph template for a simple ReAct agent, with MCP tools support and robust test suites.☆554Sep 28, 2025Updated 6 months ago
- Fast BM25 search in Python, powered by Numpy and Numba☆1,615Updated this week
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- The official repo for our SIGIR'23 Full paper: Constructing Tree-based Index for Efficient and Effective Dense Retrieval☆28Jun 7, 2023Updated 2 years ago
- An open-source implementation of Whisper☆488Oct 29, 2025Updated 5 months ago
- Tree-based indexes for neural-search☆32Mar 4, 2024Updated 2 years ago
- LATTICE turns retrieval into an LLM-driven navigation problem over a semantic scaffold☆35Mar 9, 2026Updated last month
- The first dense retrieval model that can be prompted like an LM☆91May 8, 2025Updated 11 months ago
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆25Apr 4, 2025Updated last year
- FollowIR: Evaluating and Teaching Information Retrieval Models to Follow Instructions☆53Jul 3, 2024Updated last year