Alibaba-NLP / VRAGLinks
Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement Learning"
☆411Updated last month
Alternatives and similar repositories for VRAG
Users that are interested in VRAG are comparing it to the libraries listed below
Sorting:
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆401Updated 7 months ago
- MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search too…☆360Updated 3 months ago
- R1-onevision, a visual language model capable of deep CoT reasoning.☆572Updated 7 months ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆663Updated 4 months ago
- MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding☆260Updated 4 months ago
- The development and future prospects of large multimodal reasoning models.☆557Updated 4 months ago
- Collect every awesome work about r1!☆425Updated 7 months ago
- Agentic RAG R1 Framework via Reinforcement Learning☆351Updated this week
- A Survey on Multimodal Retrieval-Augmented Generation☆438Updated last month
- Parsing-free RAG supported by VLMs☆873Updated 3 weeks ago
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆229Updated this week
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆668Updated last month
- MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources☆211Updated 2 months ago
- [EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆614Updated 5 months ago
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆1,025Updated 2 weeks ago
- ☆1,016Updated 3 weeks ago
- ☆441Updated 2 months ago
- Awesome-RAG-Vision: a curated list of advanced retrieval augmented generation (RAG) for Computer Vision