Alibaba-NLP / VRAGLinks
Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforcement Learning"
☆246Updated this week
Alternatives and similar repositories for VRAG
Users that are interested in VRAG are comparing it to the libraries listed below
Sorting:
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆341Updated 2 months ago
- GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation☆205Updated last week
- MMR1: Advancing the Frontiers of Multimodal Reasoning☆160Updated 3 months ago
- Awesome-RAG-Vision: a curated list of advanced retrieval augmented generation (RAG) for Computer Vision☆179Updated 3 weeks ago
- Agentic RAG R1 Framework via Reinforcement Learning☆215Updated last month
- ☆242Updated last month
- ☆241Updated 2 weeks ago
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆183Updated last week
- Collect every awesome work about r1!☆388Updated last month
- ☆547Updated this week
- Awesome Agent Training☆164Updated this week
- Scaling Deep Research via Reinforcement Learning in Real-world Environments.☆461Updated 2 months ago
- ☆85Updated last year
- Valley is a cutting-edge multimodal large model designed to handle a variety of tasks involving text, images, and video data.☆242Updated 4 months ago
- MegaPairs: Massive Data Synthesis For Universal Multimodal Retrieval☆189Updated last month
- ☆174Updated 4 months ago
- ☆145Updated 5 months ago
- ✨✨R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning☆149Updated last month
- MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding☆172Updated 2 months ago
- Qwen DianJin: LLMs for the Financial Industry by Alibaba Cloud☆113Updated last month
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆126Updated 7 months ago
- ☆152Updated last month
- R1-searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning☆566Updated last month
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆125Updated last month
- Search, organize, discover anything!☆49Updated last year
- Agent-R1: Training Powerful LLM Agents with End-to-End Reinforcement Learning☆573Updated last month
- ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆500Updated 2 weeks ago
- Official implementation for "ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization"☆78Updated last month
- A Survey on Multimodal Retrieval-Augmented Generation☆231Updated 3 weeks ago
- GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents☆256Updated this week