aiming-lab / MDocAgentLinks
MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding
☆229Updated last month
Alternatives and similar repositories for MDocAgent
Users that are interested in MDocAgent are comparing it to the libraries listed below
Sorting:
- Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforce…☆352Updated 3 months ago
- Agentic RAG R1 Framework via Reinforcement Learning☆302Updated 2 weeks ago
- (ICCV 2025) OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation☆87Updated 3 months ago
- FlexRAG: A RAG Framework for Information Retrieval and Generation.☆222Updated 3 months ago
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆383Updated 5 months ago
- ComoRAG is a Retrieval-Augmented Generation (RAG) system for long documents and multi-document QA, information extraction, and knowledge …☆272Updated last month
- Repo for "MaskSearch: A Universal Pre-Training Framework to Enhance Agentic Search Capability"☆146Updated 4 months ago
- Awesome Deep Research list! For more details, please refer to our survey paper -- A Comprehensive Survey of Deep Research: Systems, Metho…☆330Updated last month
- An implementation of "M3DOCRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding" by Jaemin Cho, Debanj…☆44Updated 10 months ago
- recursive rag with r1 reasoning☆328Updated 4 months ago
- Deep Research Agent CognitiveKernel-Pro from Tencent AI Lab. Paper: https://arxiv.org/pdf/2508.00414☆353Updated last month
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆262Updated last month
- GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation☆380Updated last week
- Parsing-free RAG supported by VLMs☆799Updated 7 months ago
- [Preprint] DeepSieve: Information Sieving via LLM-as-a-Knowledge-Router☆99Updated this week
- made RAG pipeline better in table data☆106Updated 11 months ago
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆229Updated 5 months ago
- 一个面向多模态大模型训练的智能数据集构建与评估平台☆124Updated last week
- [EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆565Updated 3 months ago
- Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception☆244Updated last week
- official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"☆154Updated last year
- StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization☆145Updated 8 months ago
- ☆150Updated 5 months ago
- [EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering☆112Updated 8 months ago
- MMGraphRAG is a multi-modal knowledge graph-based framework designed to enhance complex reasoning tasks, such as multi-modal document que…☆30Updated last week
- Open replication of DeepSeek R1 for text-to-graph extraction.☆99Updated 8 months ago
- [ICLR 2025] The official implementation of paper "ToolGen: Unified Tool Retrieval and Calling via Generation"☆158Updated 6 months ago
- ☆101Updated 2 months ago
- An open platform for enhancing the capability of LLMs in workflow orchestration.☆172Updated 6 months ago
- ☆197Updated 6 months ago