jina-ai/jina-vdr

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jina-ai/jina-vdr)

jina-ai / jina-vdr

Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval

☆38

Alternatives and similar repositories for jina-vdr

Users that are interested in jina-vdr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Mungeryang / colqwen3
View on GitHub
The code used to train and run inference with the ColQwen3 model. Welcome to follow and star! ⭐️⭐️⭐️ https://huggingface.co/goodman2001/…
☆15Jul 4, 2026Updated 3 weeks ago
DunZhang / Jasper-Token-Compression-Training
View on GitHub
The training codes of Jasper-Token-Compression-600M
☆20Nov 19, 2025Updated 8 months ago
VAGOsolutions / sauerkrautlm-colpali
View on GitHub
☆16Mar 1, 2026Updated 4 months ago
SalesforceAIResearch / UniDoc-Bench
View on GitHub
☆38Jun 2, 2026Updated last month
facebookresearch / MetaEmbed
View on GitHub
[ICLR 2026 Oral] Official Implementation of the paper "MetaEmbed: Scaling Multimodal Retrieval at Test-Time with Flexible Late Interactio…
☆18Jul 2, 2026Updated 3 weeks ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
roipony / flash-maxsim
View on GitHub
☆27Jun 11, 2026Updated last month
illuin-tech / modernvbert
View on GitHub
ModernVBERT is a 250M-parameter vision–language encoder that aligns a text-encoder (Ettin-150M) with a vision-encoder (SigLIP2-B) through…
☆16Oct 16, 2025Updated 9 months ago
MMDocRAG / MMDocRAG
View on GitHub
The code used to train and run inference with MMDocRAG
☆21Nov 6, 2025Updated 8 months ago
allenai / DrawEduMath
View on GitHub
Can VLMs understand students' hand-drawn math work?
☆19Jan 20, 2026Updated 6 months ago
360CVGroup / RzenEmbed
View on GitHub
Embedding model prioritized towards Multimodal RAG, overall + VisDoc double top1 on MMEB benchmark
☆36Jun 16, 2026Updated last month
mjeensung / xtr-pytorch
View on GitHub
☆19May 16, 2024Updated 2 years ago
illuin-tech / vidore-benchmark
View on GitHub
Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
☆278Mar 25, 2026Updated 4 months ago
gangiswag / cornstack
View on GitHub
☆56Jun 21, 2025Updated last year
CLUEbenchmark / Math24o
View on GitHub
Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark
☆14Mar 27, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Episoode / Double-Bench
View on GitHub
[AAAI-26] Are We on the Right Way for Assessing Document Retrieval-Augmented Generation?
☆31Dec 14, 2025Updated 7 months ago
Debrup-61 / RaDeR
View on GitHub
Official Code Repositiry for "RaDeR: Reasoning-aware Dense Retrieval Models" accepted at Main Conference EMNLP 2025
☆18Jun 23, 2025Updated last year
jina-ai / embedding-fingerprints
View on GitHub
Identify which embedding model produced a vector using digit-level tokenization and a tiny transformer
☆21Mar 7, 2026Updated 4 months ago
jina-ai / submodular-optimization
View on GitHub
Submodular optimization for context engineering: query fan-out, text selection, passage reranking
☆80Jul 14, 2025Updated last year
HanxiangQin / omni-col-press
View on GitHub
A modular framework for training and inference of (compressed) multi-vector retrieval across any modality.
☆22Apr 4, 2026Updated 3 months ago
vincentamato / mlx-esm-2
View on GitHub
An MLX implementation of Meta AI's ESM-2 protein language model
☆16Aug 16, 2025Updated 11 months ago
MinhDucBui / Multi3Hate
View on GitHub
☆15Jan 6, 2025Updated last year
alexmartin1722 / wikivideo
View on GitHub
WikiVideo: Article Generation from Multiple Videos
☆15Nov 14, 2025Updated 8 months ago
LG-AI-EXAONE / KMMLU-Pro
View on GitHub
☆16Aug 18, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
tomsherborne / zx-parse
View on GitHub
Zero-Shot Cross-Lingual Semantic Parsing (Sherborne & Lapata, ACL 2022)
☆17May 16, 2022Updated 4 years ago
modelscope / modelscope-mcp-server
View on GitHub
ModelScope's official MCP Server (in active development).
☆23Dec 15, 2025Updated 7 months ago
whybe-choi / kovidore-benchmark
View on GitHub
[ACL'26 Workshop] KoViDoRe: Korean Visual Document Retrieval Benchmark
☆24Jul 2, 2026Updated 3 weeks ago
reka-ai / rekaquant
View on GitHub
☆63Jul 10, 2025Updated last year
Snarci / RedDino
View on GitHub
☆16Mar 20, 2026Updated 4 months ago
swiss-ai / pretrain-code
View on GitHub
Pretraining codebase for Apertus models, based on Megatron-LM
☆21Sep 25, 2025Updated 10 months ago
shangshang-wang / Resa
View on GitHub
Resa: Transparent Reasoning Models via SAEs
☆50Sep 23, 2025Updated 10 months ago
OpenBMB / OpenAct
View on GitHub
☆16Oct 9, 2025Updated 9 months ago
lightonai / pylate
View on GitHub
Late Interaction Models Training & Retrieval
☆876Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
apple / ml-reversal-blessing
View on GitHub
☆17Jul 31, 2025Updated 11 months ago
tiiuae / Falcon-H1
View on GitHub
All information and news with respect to Falcon-H1 series
☆122Oct 9, 2025Updated 9 months ago
Tencent-Hunyuan / Hunyuan-4B
View on GitHub
☆16Aug 5, 2025Updated 11 months ago
OpenGVLab / Docopilot
View on GitHub
[CVPR 2025] Docopilot: Improving Multimodal Models for Document-Level Understanding
☆37Jul 22, 2025Updated last year
violetxi / ExpRL
View on GitHub
☆22Jun 16, 2026Updated last month
Tinycompany-AI / SuperTokenizer
View on GitHub
Multi-Word Probabilistic based supertokenizer
☆15May 15, 2025Updated last year
VIM-Bench / VIM_TOOL
View on GitHub
☆12Jun 12, 2024Updated 2 years ago