Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
☆261Updated this week
Alternatives and similar repositories for vidore-benchmark
Users that are interested in vidore-benchmark are comparing it to the libraries listed below
Sorting:
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆2,523Feb 19, 2026Updated last week
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆353Jun 2, 2025Updated 8 months ago
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆845Jan 28, 2025Updated last year
- Late Interaction Models Training & Retrieval☆721Feb 18, 2026Updated last week
- A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.☆1,599Dec 20, 2025Updated 2 months ago
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆495Jul 23, 2025Updated 7 months ago
- Benchmarking library for RAG☆260Feb 15, 2026Updated 2 weeks ago
- Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.☆727Jan 26, 2026Updated last month
- Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…☆3,859May 17, 2025Updated 9 months ago
- ☆10Jul 15, 2024Updated last year
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆124Sep 28, 2025Updated 5 months ago
- Recipes for shrinking, optimizing, customizing cutting edge vision models. 💜☆1,883Jan 9, 2026Updated last month
- In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.☆449Feb 13, 2024Updated 2 years ago
- ☆15May 23, 2022Updated 3 years ago
- This repository helps you evaluate your models on the FreshStack benchmark!☆33Dec 9, 2025Updated 2 months ago
- [ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark☆166Oct 14, 2025Updated 4 months ago
- Code release for "SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers" [NeurIPS D&B, 2024]☆74Jan 13, 2025Updated last year
- Creating Generative AI Apps which work☆17Apr 14, 2025Updated 10 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Oct 19, 2024Updated last year
- The Official Repository for "Bring Your Own Data! Self-Supervised Evaluation for Large Language Models"☆107Sep 23, 2023Updated 2 years ago
- Writing Blog Posts with Generative Feedback Loops!☆50Mar 19, 2024Updated last year
- Sphynx Hallucination Induction☆53Jan 31, 2025Updated last year
- MTEB: Massive Text Embedding Benchmark☆3,143Updated this week
- Low-Rank adapter extraction for fine-tuned transformers models☆180May 2, 2024Updated last year
- Parsing-free RAG supported by VLMs☆912Dec 7, 2025Updated 2 months ago
- The prime repository for state-of-the-art Multilingual Question Answering research and development.☆739Sep 18, 2025Updated 5 months ago
- Infinity is a high-throughput, low-latency serving engine for text-embeddings, reranking models, clip, clap and colpali☆2,676Feb 5, 2026Updated 3 weeks ago
- DocBench: A Benchmark for Evaluating LLM-based Document Reading Systems☆68Sep 29, 2024Updated last year
- code for training & evaluating Contextual Document Embedding models☆201May 14, 2025Updated 9 months ago
- [EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆632Jan 11, 2026Updated last month
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆42Oct 12, 2025Updated 4 months ago
- Implementation of "Efficient Multi-vector Dense Retrieval with Bit Vectors", ECIR 2024☆68Oct 21, 2025Updated 4 months ago
- ☆25Jan 30, 2026Updated last month
- [EMNLP 2025] The official implementation of "Zero-shot Multimodal Document Retrieval via Cross-Modal Question Generation"☆15Aug 26, 2025Updated 6 months ago
- Paper dataset for "Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic Papers"☆12Oct 20, 2024Updated last year
- ColBERT: state-of-the-art neural search (SIGIR'20, TACL'21, NeurIPS'21, NAACL'22, CIKM'22, ACL'23, EMNLP'23)☆3,782Oct 14, 2025Updated 4 months ago
- Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval☆38Aug 4, 2025Updated 6 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆104Oct 28, 2025Updated 4 months ago
- This repository contains expert evaluation interface and data evaluation script for the OpenScholar project.☆36Nov 19, 2024Updated last year