On the Theoretical Limitations of Embedding-Based Retrieval
☆648Sep 15, 2025Updated 9 months ago
Alternatives and similar repositories for limit
Users that are interested in limit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"☆22Mar 31, 2025Updated last year
- ArtGallery website using Django.☆13Nov 25, 2023Updated 2 years ago
- Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders☆19May 23, 2025Updated last year
- [CVPR 2026] 🔥🔥 Official Repo of USO: Unified Style and Subject-Driven Generation via Disentangled and Reward Learning☆1,228Sep 12, 2025Updated 9 months ago
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.☆209May 3, 2025Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval☆52Jan 6, 2026Updated 5 months ago
- Provides a common interface to many IR ranking datasets.☆390May 28, 2026Updated last month
- Official repository of the Seismic library.☆131Apr 8, 2026Updated 2 months ago
- XTR: Rethinking the Role of Token Retrieval in Multi-Vector Retrieval☆65Jun 20, 2024Updated 2 years ago
- MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers☆488Oct 7, 2025Updated 8 months ago
- The offcial repository for 'CharacterBERT and Self-Teaching for Improving the Robustness of Dense Retrievers on Queries with Typos', SIGI…☆16May 4, 2022Updated 4 years ago
- Anemoi: A Semi-Centralized Multi-agent Systems Based on Agent-to-Agent Communication MCP server from Coral Protocol☆370Aug 27, 2025Updated 10 months ago
- Unified Learned Sparse Retrieval Framework☆68May 13, 2024Updated 2 years ago
- An open source web crawler that searches the internet☆264Sep 6, 2025Updated 9 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Late Interaction Models Training & Retrieval☆859Updated this week
- Implement a reasoning LLM in PyTorch from scratch, step by step☆4,573Jun 12, 2026Updated 2 weeks ago
- [SIGIR 2024] The official repo for paper "Planning Ahead in Generative Retrieval: Guiding Autoregressive Generation through Simultaneous …☆32Apr 24, 2024Updated 2 years ago
- [SIGIR'24] Generative Retrieval as Multi-Vector Dense Retrieval☆36Oct 18, 2024Updated last year
- Reached #13 on Stanford's Terminal Bench leaderboard. Orchestrator, explorer & coder agents working together with intelligent context sha…☆1,389Nov 3, 2025Updated 7 months ago
- SPRINT Toolkit helps you evaluate diverse neural sparse models easily using a single click on any IR dataset.☆48Jul 25, 2023Updated 2 years ago
- Official repository for "Scaling Retrieval-Based Langauge Models with a Trillion-Token Datastore".☆226Dec 16, 2025Updated 6 months ago
- The github repository of paper "Understanding Differential Search Index for Text Retrieval" in ACL2023 Findings..☆16May 21, 2023Updated 3 years ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆230Jun 20, 2026Updated last week
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Cocktail: A Comprehensive Information Retrieval Benchmark with LLM-Generated Documents Integration☆15Jun 4, 2024Updated 2 years ago
- RankLLM is a Python toolkit for reproducible information retrieval research using rerankers, with a focus on listwise reranking.☆608Jun 19, 2026Updated last week
- MCP-Universe is a comprehensive framework designed for RL training, benchmarking, and developing AI agents for general tool-use.☆590Jun 23, 2026Updated last week
- code for training & evaluating Contextual Document Embedding models☆206May 14, 2025Updated last year
- ☆19May 16, 2024Updated 2 years ago
- [ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval☆204Sep 13, 2025Updated 9 months ago
- Semantic search and document parsing tools for the command line☆1,828Mar 11, 2026Updated 3 months ago
- ☆22Jul 11, 2025Updated 11 months ago
- An MCP server exposing full Chrome DevTools Protocol debugging: breakpoints, step/run, call stacks, eval, and source maps.☆345Oct 2, 2025Updated 8 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Fast BM25 search in Python, powered by Numpy and Numba☆1,715Jun 11, 2026Updated 2 weeks ago
- The official repo for our SIGIR'23 Full paper: Constructing Tree-based Index for Efficient and Effective Dense Retrieval☆28Jun 7, 2023Updated 3 years ago
- LangGraph template for a simple ReAct agent, with MCP tools support and robust test suites.☆559Sep 28, 2025Updated 9 months ago
- An open-source implementation of Whisper☆491Oct 29, 2025Updated 8 months ago
- LATTICE turns retrieval into an LLM-driven navigation problem over a semantic scaffold☆37Mar 9, 2026Updated 3 months ago
- The first dense retrieval model that can be prompted like an LM☆93May 8, 2025Updated last year
- ☆39Nov 21, 2022Updated 3 years ago