Omaralsaabi / M3DOCRAGLinks
An implementation of "M3DOCRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding" by Jaemin Cho, Debanjan Mahata, Ozan Irsoy, Yujie He, and Mohit Bansal (UNC Chapel Hill & Bloomberg).
☆42Updated 9 months ago
Alternatives and similar repositories for M3DOCRAG
Users that are interested in M3DOCRAG are comparing it to the libraries listed below
Sorting:
- (ICCV 2025) OCR Hinders RAG: Evaluating the Cascading Impact of OCR on Retrieval-Augmented Generation☆85Updated last month
- ☆42Updated 2 months ago
- Code for Parametric RAG, SIGIR 2025 Full Paper☆185Updated 3 months ago
- MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding☆194Updated this week
- DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents☆267Updated last week
- Implementation and evaluation of multimodal RAG with text and image inputs for industrial applications☆60Updated 9 months ago
- This is the code repo for our paper "Benchmarking Retrieval-Augmented Generation in Multi-Modal Contexts".☆37Updated last week
- Official repository for RAG-Gym☆112Updated 5 months ago
- Dataset and Code for our ACL 2024 paper: "Multimodal Table Understanding". We propose the first large-scale Multimodal IFT and Pre-Train …☆213Updated 2 months ago
- The demo, code and data of FollowRAG☆74Updated last month
- Implementation of "RAT: Retrieval Augmented Thoughts Elicit Context-Aware Reasoning in Long-Horizon Generation".☆242Updated last year
- Awesome-Large-Search-Models is a collection of papers and resources (Methods, Datasets and other resources) about awesome agentic search …☆113Updated last month
- The official repository for the paper: Evaluation of Retrieval-Augmented Generation: A Survey.☆166Updated 3 months ago
- Official repo for "LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs".☆235Updated 11 months ago
- Document Haystacks: Vision-Language Reasoning Over Piles of 1000+ Documents, CVPR 2025☆21Updated 6 months ago
- ☆59Updated last month
- A curated list of recent and past chart understanding work based on our IEEE TKDE survey paper: From Pixels to Insights: A Survey on Auto…☆215Updated last month
- [EMNLP 2024 Findings] OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs.☆147Updated 9 months ago
- ☆186Updated 4 months ago
- Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …☆79Updated 8 months ago
- Document Artifical Intelligence☆184Updated 3 months ago
- Official Implementation of "Multi-Head RAG: Solving Multi-Aspect Problems with LLMs"☆223Updated last month
- [NAACL'24] Dataset, code and models for "TableLlama: Towards Open Large Generalist Models for Tables".☆130Updated last year
- The All-in-one Judge Models introduced by Opencompass☆109Updated 3 weeks ago
- Tool-Star: Empowering LLM-brained Multi-Tool Reasoner via Reinforcement Learning☆225Updated 2 weeks ago
- Meta-Chunking: Learning Efficient Text Segmentation via Logical Perception☆238Updated 2 months ago
- Official repository for paper "ReasonIR Training Retrievers for Reasoning Tasks".☆190Updated last month
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation☆303Updated 9 months ago
- This is the official repository for Auto-RAG.☆218Updated 3 weeks ago
- A Survey on Multimodal Retrieval-Augmented Generation☆291Updated last week