Omaralsaabi / M3DOCRAGLinks
An implementation of "M3DOCRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding" by Jaemin Cho, Debanjan Mahata, Ozan Irsoy, Yujie He, and Mohit Bansal (UNC Chapel Hill & Bloomberg).
☆47Updated last year
Alternatives and similar repositories for M3DOCRAG
Users that are interested in M3DOCRAG are comparing it to the libraries listed below
Sorting:
- (ICCV 2025) OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation☆94Updated 5 months ago
- MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding☆258Updated 4 months ago
- This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".☆41Updated 2 months ago
- Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception☆253Updated 2 months ago
- Data and Code for EMNLP 2025 Findings Paper "MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search"☆82Updated last month
- Repo for "VRAG-RL: Empower Vision-Perception-Based RAG for Visually Rich Information Understanding via Iterative Reasoning with Reinforce…☆406Updated last month
- ☆203Updated 8 months ago
- ☆54Updated 6 months ago
- Implementation and evaluation of multimodal RAG with text and image inputs for industrial applications☆64Updated last year
- The code and data of DPA-RAG, accepted by WWW 2025 main conference.☆63Updated last month
- Codes for our paper "RQ-RAG: Learning to Refine Queries for Retrieval Augmented Generation"☆195Updated last year
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation☆310Updated last year
- Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …☆220Updated 5 months ago
- The All-in-one Judge Models introduced by Opencompass☆114Updated 4 months ago
- Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents, CVPR 2025☆25Updated 10 months ago
- ☆33Updated last month
- Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …☆89Updated last year
- The official repository for the paper: Evaluation of Retrieval-Augmented Generation: A Survey.☆183Updated 7 months ago
- StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization☆152Updated 10 months ago
- MMGraphRAG is a multi-modal knowledge graph-based framework designed to enhance complex reasoning tasks, such as multi-modal document que…☆47Updated 2 weeks ago
- Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs (ACL 2024)☆72Updated 7 months ago
- This is the code repo for our paper "Enhancing Knowledge Integration and Utilization of Large Language Models via Constructivist Cognitio…☆111Updated last month
- Code for Parametric RAG, SIGIR 2025 Full Paper☆210Updated 7 months ago
- Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".☆249Updated last year
- The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.☆103Updated 6 months ago
- Official repository for paper "TableBench: A Comprehensive and Complex Benchmark for Table Question Answering"☆76Updated 7 months ago
- Implementation for OAgents: An Empirical Study of Building Effective Agents☆289Updated last month
- The demo, code and data of FollowRAG☆75Updated 5 months ago
- [Neurips2024] Source code for xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token☆164Updated last year
- Open replication of DeepSeek R1 for text-to-graph extraction.☆99Updated 10 months ago