illuin-tech / vidore-benchmarkLinks
Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
β259Updated 2 weeks ago
Alternatives and similar repositories for vidore-benchmark
Users that are interested in vidore-benchmark are comparing it to the libraries listed below
Sorting:
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. π¨π»βπ³β352Updated 8 months ago
- β147Updated last year
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"β237Updated 4 months ago
- Banishing LLM Hallucinations Requires Rethinking Generalizationβ277Updated last year
- β120Updated last year
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".β241Updated last year
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language Mβ¦β249Updated last year
- awesome synthetic (text) datasetsβ321Updated 3 weeks ago
- (ICCV 2025) OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generationβ95Updated 2 months ago
- β82Updated 2 months ago
- Attribute (or cite) statements generated by LLMs back to in-context information.β319Updated last year
- Official repo for "Make Your LLM Fully Utilize the Context"β263Updated last year
- Manage scalable open LLM inference endpoints in Slurm clustersβ280Updated last year
- Repository for βPlanRAG: A Plan-then-Retrieval Augmented Generation for Generative Large Language Models as Decision Makersβ, NAACL24β151Updated last year
- The first dense retrieval model that can be prompted like an LMβ90Updated 8 months ago
- Codebase accompanying the Summary of a Haystack paper.β80Updated last year
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on taskβ¦β184Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo rankerβ126Updated 3 months ago
- Complex Function Calling Benchmark.β163Updated last year
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".β217Updated 7 months ago
- Code for explaining and evaluating late chunking (chunked pooling)β487Updated last year
- β161Updated last year
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"β120Updated 3 months ago
- MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents [EMNLP 2024]β194Updated 5 months ago
- [EMNLP 2024] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.β147Updated last year
- XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.β181Updated 9 months ago
- Beating the GAIA benchmark with Transformers Agents. πβ145Updated 11 months ago
- Simple UI for debugging correlations of text embeddingsβ305Updated 8 months ago
- Simple examples using Argilla tools to build AIβ57Updated last year
- Baguetter is a flexible, efficient, and hackable search engine library implemented in Python. It's designed for quickly benchmarking, impβ¦β202Updated last year