Alibaba-NLP / VRAGLinks
Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement Learning"
☆332Updated 2 months ago
Alternatives and similar repositories for VRAG
Users that are interested in VRAG are comparing it to the libraries listed below
Sorting:
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆374Updated 4 months ago
- MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search too…☆312Updated 3 weeks ago
- R1-onevision, a visual language model capable of deep CoT reasoning.☆566Updated 5 months ago
- Agentic Foundation Platform☆471Updated this week
- Collect every awesome work about r1!☆416Updated 4 months ago
- MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding☆223Updated last month
- Deep Research Agent CognitiveKernel-Pro from Tencent AI Lab. Paper: https://arxiv.org/pdf/2508.00414☆336Updated 3 weeks ago
- Agentic RAG R1 Framework via Reinforcement Learning☆292Updated this week
- GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation☆349Updated this week
- [ACL 2025 Oral] 🔥🔥 MegaPairs: Massive Data Synthesis for Universal Multimodal Retrieval☆223Updated 3 months ago
- Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …☆215Updated 3 months ago
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆221Updated 3 months ago
- MMR1: Advancing the Frontiers of Multimodal Reasoning☆163Updated 6 months ago
- ☆336Updated 3 months ago
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆596Updated 5 months ago
- The development and future prospects of multimodal reasoning models.☆491Updated last month
- 🔧Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆253Updated last week
- Awesome Agent Training☆225Updated 2 weeks ago
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆633Updated last month
- ☆369Updated 7 months ago
- PC Agent: While You Sleep, AI Works - A Cognitive Journey into Digital World☆282Updated 3 months ago
- [EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆540Updated 3 months ago
- ☆804Updated 2 weeks ago
- Awesome-RAG-Vision: a curated list of advanced retrieval augmented generation (RAG) for Computer Vision☆227Updated 3 weeks ago
- ☆90Updated last year
- ZO2 (Zeroth-Order Offloading): Full Parameter Fine-Tuning 175B LLMs with 18GB GPU Memory☆183Updated 2 months ago
- Parsing-free RAG supported by VLMs☆783Updated 7 months ago
- A Survey on Multimodal Retrieval-Augmented Generation☆353Updated 3 weeks ago
- Awesome-Large-Search-Models is a collection of papers and resources (Methods, Datasets and other resources) about awesome agentic search …☆119Updated 2 weeks ago
- [2025-上海人工智能实验室书生实训营十佳、优秀项目]☆34Updated last month