illuin-tech/vidore-benchmark

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/illuin-tech/vidore-benchmark)

illuin-tech / vidore-benchmark

Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.

☆278

Alternatives and similar repositories for vidore-benchmark

Users that are interested in vidore-benchmark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

illuin-tech / colpali
View on GitHub
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
☆2,706Jul 13, 2026Updated last week
tonywu71 / colpali-cookbooks
View on GitHub
Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻‍🍳
☆357Jun 2, 2025Updated last year
AnswerDotAI / byaldi
View on GitHub
Use late-interaction multi-modal models such as ColPali in just a few lines of code.
☆851Jan 28, 2025Updated last year
jina-ai / jina-vdr
View on GitHub
Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval
☆38Aug 4, 2025Updated 11 months ago
adithya-s-k / VARAG
View on GitHub
Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine
☆497Jul 23, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
mayubo2333 / MMLongBench-Doc
View on GitHub
Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations
☆149Sep 28, 2025Updated 9 months ago
lightonai / pylate
View on GitHub
Late Interaction Models Training & Retrieval
☆876Updated this week
AnswerDotAI / rerankers
View on GitHub
A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.
☆1,626Dec 20, 2025Updated 7 months ago
Mungeryang / colqwen3
View on GitHub
The code used to train and run inference with the ColQwen3 model. Welcome to follow and star! ⭐️⭐️⭐️ https://huggingface.co/goodman2001/…
☆15Jul 4, 2026Updated 2 weeks ago
texttron / tevatron
View on GitHub
Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.
☆742Updated this week
ChenyuHeidiZhang / VL-commonsense
View on GitHub
☆14May 23, 2022Updated 4 years ago
naver / bergen
View on GitHub
Benchmarking library for RAG
☆276Jul 14, 2026Updated last week
AnswerDotAI / RAGatouille
View on GitHub
Easily use and train state of the art late-interaction retrieval methods (ColBERT) in any RAG pipeline. Designed for modularity and ease-…
☆3,943May 17, 2025Updated last year
VAGOsolutions / sauerkrautlm-colpali
View on GitHub
☆16Mar 1, 2026Updated 4 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
xlang-ai / BRIGHT
View on GitHub
[ICLR 2025] BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
☆206Sep 13, 2025Updated 10 months ago
lightonai / fast-plaid
View on GitHub
High-Performance Engine for Multi-Vector Search
☆271May 28, 2026Updated last month
AIR-Bench / AIR-Bench
View on GitHub
[ACL 2025] AIR-Bench: Automated Heterogeneous Information Retrieval Benchmark
☆167Mar 29, 2026Updated 3 months ago
Alibaba-NLP / ViDoRAG
View on GitHub
[EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents
☆669Jan 11, 2026Updated 6 months ago
KarelDO / xmc.dspy
View on GitHub
In-Context Learning for eXtreme Multi-Label Classification (XMC) using only a handful of examples.
☆457Feb 13, 2024Updated 2 years ago
jxmorris12 / cde
View on GitHub
code for training & evaluating Contextual Document Embedding models
☆207May 14, 2025Updated last year
haon-chen / mmE5
View on GitHub
☆59Feb 27, 2025Updated last year
TIGER-AI-Lab / StructLM
View on GitHub
Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)
☆76Oct 19, 2024Updated last year
jlscheerer / xtr-warp
View on GitHub
XTR/WARP (SIGIR'25) is an extremely fast and accurate retrieval engine based on Stanford's ColBERTv2/PLAID and Google DeepMind's XTR.
☆209May 3, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
UpstageAI / evalverse-IFEval
View on GitHub
Submodule of evalverse forked from [google-research/instruction_following_eval](https://github.com/google-research/google-research/tree/m…
☆15May 4, 2024Updated 2 years ago
elicit / fave-dataset
View on GitHub
Paper dataset for "Factored Verification: Detecting and Reducing Hallucination in Summaries of Academic Papers"
☆13Oct 20, 2024Updated last year
OpenBMB / VisRAG
View on GitHub
Parsing-free RAG supported by VLMs
☆972Jul 17, 2026Updated last week
texttron / AgentIR
View on GitHub
AgentIR is a retriever specialized for Deep Research agents.
☆62Apr 16, 2026Updated 3 months ago
MananSuri27 / VisDoM
View on GitHub
☆45Jul 28, 2025Updated 11 months ago
microsoft / echo-rl
View on GitHub
☆55May 26, 2026Updated last month
fsndzomga / open_source_lrm
View on GitHub
☆10Oct 24, 2024Updated last year
amitakamath / vl_text_encoders_are_bottlenecks
View on GitHub
Code and datasets for "Text encoders are performance bottlenecks in contrastive vision-language models". Coming soon!
☆11May 24, 2023Updated 3 years ago
DataArcTech / RagVL
View on GitHub
Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …
☆92Nov 15, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
thongnt99 / lsr-multimodal
View on GitHub
ECIR 2024: Sparse lexical representation for image-text retrieval
☆13Jul 8, 2024Updated 2 years ago
primeqa / primeqa
View on GitHub
The prime repository for state-of-the-art Multilingual Question Answering research and development.
☆740Sep 18, 2025Updated 10 months ago
jina-ai / correlations
View on GitHub
Simple UI for debugging correlations of text embeddings
☆315May 28, 2025Updated last year
vespa-engine / pyvespa
View on GitHub
Python API for https://vespa.ai, the open big data serving engine
☆171Updated this week
bloomberg / m3docrag
View on GitHub
☆71May 19, 2025Updated last year
ielab / llm-rankers
View on GitHub
Document Ranking with Large Language Models.
☆210Feb 14, 2026Updated 5 months ago
yejinc00 / PREMIR
View on GitHub
[EMNLP 2025] The official implementation of "Zero-shot Multimodal Document Retrieval via Cross-Modal Question Generation"
☆15Aug 26, 2025Updated 10 months ago