An implementation of "M3DOCRAG: Multi-modal Retrieval is What You Need for Multi-page Multi-document Understanding" by Jaemin Cho, Debanjan Mahata, Ozan Irsoy, Yujie He, and Mohit Bansal (UNC Chapel Hill & Bloomberg).
☆48Nov 13, 2024Updated last year
Alternatives and similar repositories for M3DOCRAG
Users that are interested in M3DOCRAG are comparing it to the libraries listed below
Sorting:
- ☆61May 19, 2025Updated 9 months ago
- (ICML 2024) Improve Context Understanding in Multimodal Large Language Models via Multimodal Composition Learning☆28Sep 27, 2024Updated last year
- [NeurIPS 2024] Official Code for the Paper "Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning"☆28Apr 8, 2025Updated 10 months ago
- MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding☆300Aug 8, 2025Updated 6 months ago
- This is the official repository for Retrieval Augmented Visual Question Answering☆244Dec 19, 2024Updated last year
- GroundCUA☆68Dec 24, 2025Updated 2 months ago
- [ICCV 2025] ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models☆49Jul 7, 2025Updated 7 months ago
- A library of visualization tools for the interpretability and hallucination analysis of large vision-language models (LVLMs).☆41May 22, 2025Updated 9 months ago
- Simple chatbot created using Rasa☆10Feb 20, 2021Updated 5 years ago
- [EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆632Jan 11, 2026Updated last month
- ☆33Dec 29, 2024Updated last year
- Adds custom tabs to user account☆10Nov 27, 2025Updated 3 months ago
- ☆10Mar 18, 2019Updated 6 years ago
- ☆11Oct 31, 2024Updated last year
- Website nhận diện và trích xuất thông tin từ Chứng Minh Nhân Dân☆11Oct 6, 2022Updated 3 years ago
- [NeurIPS 2025] Official Implementation of paper "Sherlock: Self-Correcting Reasoning in Vision-Language Models"☆28Sep 18, 2025Updated 5 months ago
- Instituto de Telecomunicações Deep Learning-based Point Cloud Codec☆11Jun 18, 2024Updated last year
- ☆13May 13, 2015Updated 10 years ago
- Building a multi-agent RAG system with advanced RAG methods☆12Jan 12, 2025Updated last year
- MPC Server for PySpark inpired by the LakeSail☆17Updated this week
- ☆11Aug 17, 2014Updated 11 years ago
- Code for paper "Rethinking Text-based Protein Understanding: Retrieval or LLM?"☆18Oct 7, 2025Updated 4 months ago
- Reason-before-Retrieve: One-Stage Reflective Chain-of-Thoughts for Training-Free Zero-Shot Composed Image Retrieval [CVPR 2025 Highlight]☆65Jul 8, 2025Updated 7 months ago
- A WhatsApp client library for NodeJS that connects through the WhatsApp Web browser app☆12Jan 15, 2026Updated last month
- A holistic framework for advancing LLMs as data science agents☆33Feb 3, 2026Updated 3 weeks ago
- AWESOME of Tencent Cloud Base 😎☆10Feb 2, 2019Updated 7 years ago
- ☆11May 28, 2023Updated 2 years ago
- PHP class using GD library to work easily with images as layers (like Photoshop or GIMP)☆24Oct 31, 2018Updated 7 years ago
- Pytorch implementation of the StarNet paper algorithm☆10Jan 25, 2022Updated 4 years ago
- Open-source clone of OpenAI's Deep Research. Works with any transformer, gpt4free, & runs in browser. No Firecrawl needed.☆12Jun 12, 2025Updated 8 months ago
- niucloud-admin是一款快速开发SaaS通用管理系统后台框架,【您不需要重复造轮子 – 开发应用便拥有自主版权】! 前端采用最新的技术栈Vite+TypeScript+Vue3+ElementPlus最流行技术架构,后台结合PHP8、Java SDK、Python…☆13Mar 12, 2024Updated last year
- mouse pet-ct image segmentation☆12Feb 19, 2023Updated 3 years ago
- Amazon Connector for odoo☆13Jan 30, 2025Updated last year
- EraX-VL-7B-V1 is the multimodal large language model developed by EraX team, base on Qwen2-VL.☆12Dec 31, 2024Updated last year
- The fully connected neural network implemented in Numpy, from scratch, in Tensorflow and in Keras. The bonus code: Implementation of many…☆12Jul 28, 2017Updated 8 years ago
- A Flask app to document and test Slack's interactive messages.☆10Mar 19, 2021Updated 4 years ago
- Keyphrase Extraction from Scholarly Documents - Thesis☆14Nov 3, 2021Updated 4 years ago
- A simple implementation of an artificial neural network based with Apache Spark and python. this is another implementation of my toy prog…☆11Jul 28, 2017Updated 8 years ago
- Connect to the VestaCP API in your Laravel web application.☆11Oct 1, 2020Updated 5 years ago