Alibaba-NLP/ViDoRAG

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Alibaba-NLP/ViDoRAG)

Alibaba-NLP / ViDoRAG

[EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents

☆668

Alternatives and similar repositories for ViDoRAG

Users that are interested in ViDoRAG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Alibaba-NLP / VRAG
View on GitHub
Multimodal Retrieval-augmented Generation Framework Built by Tongyi Lab, Alibaba Group.
☆968Apr 29, 2026Updated 2 months ago
OpenBMB / VisRAG
View on GitHub
Parsing-free RAG supported by VLMs
☆970Updated this week
microsoft / PIKE-RAG
View on GitHub
PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation
☆2,446Sep 10, 2025Updated 10 months ago
ictnlp / FlexRAG
View on GitHub
FlexRAG: A RAG Framework for Information Retrieval and Generation.
☆238Jun 30, 2026Updated 3 weeks ago
illuin-tech / colpali
View on GitHub
The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.
☆2,704Jul 13, 2026Updated last week
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
aiming-lab / MDocAgent
View on GitHub
MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding
☆352Aug 8, 2025Updated 11 months ago
Episoode / Double-Bench
View on GitHub
[AAAI-26] Are We on the Right Way for Assessing Document Retrieval-Augmented Generation?
☆31Dec 14, 2025Updated 7 months ago
Alibaba-NLP / MaskSearch
View on GitHub
Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"
☆155May 27, 2025Updated last year
Alibaba-NLP / ZeroSearch
View on GitHub
ZeroSearch: Incentivize the Search Capability of LLMs without Searching
☆1,305Aug 16, 2025Updated 11 months ago
zilliztech / deep-searcher
View on GitHub
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
☆8,009Nov 19, 2025Updated 8 months ago
Omaralsaabi / M3DOCRAG
View on GitHub
An implementation of "M3DOCRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding" by Jaemin Cho, Debanj…
☆56Nov 13, 2024Updated last year
nttmdlab-nlp / VDocRAG
View on GitHub
[CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents
☆66May 26, 2025Updated last year
illuin-tech / vidore-benchmark
View on GitHub
Vision Document Retrieval (ViDoRe): Benchmark. Evaluation code for the ColPali paper.
☆278Mar 25, 2026Updated 3 months ago
HKUDS / VideoRAG
View on GitHub
[KDD'2026] "VideoRAG: Chat with Your Videos"
☆3,191Mar 18, 2026Updated 4 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
DataArcTech / RagVL
View on GitHub
Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …
☆92Nov 15, 2024Updated last year
KRLabsOrg / LettuceDetect
View on GitHub
Span-level grounding verification for RAG, code, and tool-grounded AI outputs.
☆585Updated this week
wang-qiuchen / PseDet
View on GitHub
[ICLR 2025] PseDet: Revisiting the Power of Pseudo Label in Incremental Object Detection
☆23Sep 16, 2025Updated 10 months ago
Ji-Cather / GraphAgent
View on GitHub
Code for ACL25-findings. An LLM-based agent simulation framework that simulates human behavior and generates dynamic, text-based social g…
☆96Mar 15, 2026Updated 4 months ago
OpenSPG / KAG
View on GitHub
KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning a…
☆8,917Jan 28, 2026Updated 5 months ago
Alibaba-NLP / OmniSearch
View on GitHub
Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent
☆429Apr 22, 2025Updated last year
tablegpt / tablegpt-agent
View on GitHub
A pre-built agent for TableGPT2.
☆638Jun 24, 2026Updated 3 weeks ago
Alibaba-NLP / DeepResearch
View on GitHub
Tongyi Deep Research, the Leading Open-source Deep Research Agent
☆19,691Feb 27, 2026Updated 4 months ago
llm-lab-org / Multimodal-RAG-Survey
View on GitHub
A Survey on Multimodal Retrieval-Augmented Generation
☆532Feb 20, 2026Updated 5 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
InternLM / MindSearch
View on GitHub
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
☆6,892Jul 4, 2025Updated last year
bloomberg / m3docrag
View on GitHub
☆71May 19, 2025Updated last year
Terry-Xu-666 / NodeRAG
View on GitHub
The official repository of NodeRAG
☆416Mar 19, 2025Updated last year
microsoft / KBLaM
View on GitHub
Official Implementation of "KBLaM: Knowledge Base augmented Language Model"
☆1,450Jul 2, 2026Updated 2 weeks ago
HKUDS / MiniRAG
View on GitHub
[ACL2026] "MiniRAG: Making RAG Simpler with Small and Open-Sourced Language Models"
☆1,980Oct 16, 2025Updated 9 months ago
MoonshotAI / MoBA
View on GitHub
MoBA: Mixture of Block Attention for Long-Context LLMs
☆2,150Apr 3, 2025Updated last year
Alibaba-NLP / CHRONOS
View on GitHub
Repo for NAACL 2025 Paper "Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization"
☆298Aug 4, 2025Updated 11 months ago
bytedance / Valley
View on GitHub
Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, video, and audio data.
☆287May 8, 2026Updated 2 months ago
MMDocRAG / MMDocRAG
View on GitHub
The code used to train and run inference with MMDocRAG
☆21Nov 6, 2025Updated 8 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
OpenDataBox / ST-Raptor
View on GitHub
LLM-Powered Semi-Structured Table Question Answering
☆309Apr 3, 2026Updated 3 months ago
rag-web-ui / rag-web-ui
View on GitHub
RAG Web UI is an intelligent dialogue system based on RAG (Retrieval-Augmented Generation) technology.
☆3,072Apr 6, 2026Updated 3 months ago
om-ai-lab / VLM-R1
View on GitHub
Solve Visual Understanding with Reinforced VLMs
☆6,010Jul 7, 2026Updated 2 weeks ago
thunlp / Migician
View on GitHub
[ACL2025 Findings] Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models
☆90May 20, 2025Updated last year
QwenLM / Qwen-Agent
View on GitHub
Agent framework and applications built upon Qwen>=3.0, featuring Function Calling, MCP, Code Interpreter, RAG, Chrome extension, etc.
☆16,821Mar 4, 2026Updated 4 months ago
allenai / olmocr
View on GitHub
Toolkit for linearizing PDFs for LLM datasets/training
☆19,134Mar 25, 2026Updated 3 months ago
OpenBMB / UltraRAG
View on GitHub
A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines
☆5,654Updated this week