Omaralsaabi / M3DOCRAGLinks
An implementation of "M3DOCRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding" by Jaemin Cho, Debanjan Mahata, Ozan Irsoy, Yujie He, and Mohit Bansal (UNC Chapel Hill & Bloomberg).
☆48Updated last year
Alternatives and similar repositories for M3DOCRAG
Users that are interested in M3DOCRAG are comparing it to the libraries listed below
Sorting:
- (ICCV 2025) OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation☆94Updated 2 months ago
- ☆61Updated 8 months ago
- MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding☆294Updated 6 months ago
- Implementation and evaluation of multimodal RAG with text and image inputs for industrial applications☆67Updated last year
- Data and Code for EMNLP 2025 Findings Paper "MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search"☆86Updated 3 months ago
- This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".☆43Updated 4 months ago
- MMGraphRAG is a multi-modal knowledge graph-based framework designed to enhance complex reasoning tasks, such as multi-modal document que…☆63Updated 3 weeks ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆306Updated 3 months ago
- The demo, code and data of FollowRAG☆75Updated 7 months ago
- [EMNLP 2024] LongRAG: A Dual-perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering☆120Updated last year
- Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforce…☆441Updated 3 weeks ago
- ☆212Updated 10 months ago
- The All-in-one Judge Models introduced by Opencompass☆117Updated 6 months ago
- [CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents☆58Updated 8 months ago
- Welcome! 😊 This is the official code release of EviNote-RAG, and we’re happy to share it with the community.☆44Updated 2 months ago
- Code for Parametric RAG, SIGIR 2025 Full Paper☆222Updated 9 months ago
- Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception☆273Updated 4 months ago
- Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".☆247Updated last year
- The code for paper: Decoupled Planning and Execution: A Hierarchical Reasoning Framework for Deep Search☆63Updated 7 months ago
- Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …☆224Updated 7 months ago
- ☆34Updated 4 months ago
- Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents, CVPR 2025☆25Updated last year
- [EMNLP 2024] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.☆147Updated last year
- Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …☆91Updated last year
- MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search too…☆387Updated 5 months ago
- [ACL 2025] An official pytorch implement of the paper: Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement☆40Updated 8 months ago
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆120Updated 4 months ago
- This is the code repo for our paper "Enhancing Knowledge Integration and Utilization of Large Language Models via Constructivist Cognitio…☆110Updated 4 months ago
- Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation☆59Updated 4 months ago
- UniversalRAG: Retrieval-Augmented Generation over Corpora of Diverse Modalities and Granularities☆157Updated 8 months ago