Parsing-free RAG supported by VLMs
☆935Dec 7, 2025Updated 3 months ago
Alternatives and similar repositories for VisRAG
Users that are interested in VisRAG are comparing it to the libraries listed below
Sorting:
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆2,560Mar 1, 2026Updated 2 weeks ago
- Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …☆91Nov 15, 2024Updated last year
- This is the code repo for the paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards".☆24Oct 28, 2024Updated last year
- [EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆641Jan 11, 2026Updated 2 months ago
- ☆58Oct 18, 2024Updated last year
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆130Nov 6, 2024Updated last year
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation☆310Oct 18, 2024Updated last year
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆417Apr 22, 2025Updated 10 months ago
- [CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents☆61May 26, 2025Updated 9 months ago
- Empowering RAG with a memory-based data interface for all-purpose applications!☆2,227Sep 11, 2025Updated 6 months ago
- This is the code repo for the paper "Learning to Route Queries Across Knowledge Bases for Step-wise Retrieval-Augmented Reasoning".☆38Aug 22, 2025Updated 6 months ago
- Multimodal Retrieval-augmented Generation Framework Built by Tongyi Lab, Alibaba Group.☆478Feb 17, 2026Updated last month
- ⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)☆3,405Mar 1, 2026Updated 2 weeks ago
- A simple, easy-to-hack GraphRAG implementation☆3,727Jan 27, 2026Updated last month
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆497Jul 23, 2025Updated 7 months ago
- Retrieval and Retrieval-augmented LLMs☆11,410Mar 10, 2026Updated last week
- Solve Visual Understanding with Reinforced VLMs☆5,865Mar 12, 2026Updated last week
- 🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]☆1,183Nov 17, 2025Updated 4 months ago
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding☆2,375May 30, 2025Updated 9 months ago
- [EMNLP 2024 Findings] The official PyTorch implementation of EchoSight: Advancing Visual-Language Models with Wiki Knowledge.☆81Jan 19, 2026Updated 2 months ago
- ☆132Apr 7, 2025Updated 11 months ago
- MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding☆307Aug 8, 2025Updated 7 months ago
- Code for our paper: "Building A Coding Assistant via Retrieval-Augmented Language Models"☆10Nov 2, 2024Updated last year
- A new novel multi-modality (Vision) RAG architecture☆40Oct 1, 2024Updated last year
- Implementation and evaluation of multimodal RAG with text and image inputs for industrial applications☆70Nov 6, 2024Updated last year
- ☆29Aug 19, 2024Updated last year
- This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,…☆2,340May 25, 2024Updated last year
- ☆13Jul 13, 2023Updated 2 years ago
- [NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge a…☆3,283Sep 4, 2025Updated 6 months ago
- [ACL 2024 Oral] This is the code repo for our ACL‘24 paper "MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Mo…☆39Jun 30, 2024Updated last year
- ☆38Jan 9, 2026Updated 2 months ago
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆844Jan 28, 2025Updated last year
- A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines☆5,459Mar 13, 2026Updated last week
- MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search too…☆409Aug 26, 2025Updated 6 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,102Feb 10, 2025Updated last year
- KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning a…☆8,623Jan 28, 2026Updated last month
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆31,474Updated this week
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆355Jun 2, 2025Updated 9 months ago
- [EMNLP2025] "LightRAG: Simple and Fast Retrieval-Augmented Generation"☆29,469Updated this week