Parsing-free RAG supported by VLMs
☆942Dec 7, 2025Updated 4 months ago
Alternatives and similar repositories for VisRAG
Users that are interested in VisRAG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆2,586Mar 31, 2026Updated last week
- Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …☆91Nov 15, 2024Updated last year
- This is the code repo for the paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards".☆24Oct 28, 2024Updated last year
- [EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆649Jan 11, 2026Updated 2 months ago
- ☆59Oct 18, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆130Nov 6, 2024Updated last year
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation☆310Oct 18, 2024Updated last year
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆418Apr 22, 2025Updated 11 months ago
- [CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents☆63May 26, 2025Updated 10 months ago
- Empowering RAG with a memory-based data interface for all-purpose applications!☆2,239Sep 11, 2025Updated 6 months ago
- ⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)☆3,445Mar 1, 2026Updated last month
- Multimodal Retrieval-augmented Generation Framework Built by Tongyi Lab, Alibaba Group.☆494Apr 3, 2026Updated last week
- [SIGIR 2026] This is the code repo for the paper "Mixture-of-Retrieval Experts for Reasoning-Guided Multimodal Knowledge Exploitation".☆38Aug 22, 2025Updated 7 months ago
- A simple, easy-to-hack GraphRAG implementation☆3,758Jan 27, 2026Updated 2 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆498Jul 23, 2025Updated 8 months ago
- Retrieval and Retrieval-augmented LLMs☆11,502Apr 1, 2026Updated last week
- Solve Visual Understanding with Reinforced VLMs☆5,925Mar 12, 2026Updated 3 weeks ago
- [EMNLP 2024 Findings] The official PyTorch implementation of EchoSight: Advancing Visual-Language Models with Wiki Knowledge.☆81Jan 19, 2026Updated 2 months ago
- ☆133Apr 7, 2025Updated last year
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding☆2,373May 30, 2025Updated 10 months ago
- Code for our paper: "Building A Coding Assistant via Retrieval-Augmented Language Models"☆10Nov 2, 2024Updated last year
- 🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]☆1,196Nov 17, 2025Updated 4 months ago
- A new novel multi-modality (Vision) RAG architecture☆40Oct 1, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Implementation and evaluation of multimodal RAG with text and image inputs for industrial applications☆70Nov 6, 2024Updated last year
- ☆29Aug 19, 2024Updated last year
- This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,…☆2,355May 25, 2024Updated last year
- ☆12Jul 13, 2023Updated 2 years ago
- [ACL 2024 Oral] This is the code repo for our ACL‘24 paper "MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Mo…☆39Jun 30, 2024Updated last year
- ☆38Jan 9, 2026Updated 3 months ago
- MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding☆318Aug 8, 2025Updated 8 months ago
- [NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge a…☆3,341Sep 4, 2025Updated 7 months ago
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆846Jan 28, 2025Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines☆5,476Updated this week
- KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning a…☆8,664Jan 28, 2026Updated 2 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,110Feb 10, 2025Updated last year
- MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal search too…☆416Aug 26, 2025Updated 7 months ago
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆31,971Apr 3, 2026Updated last week
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆24,652Aug 12, 2024Updated last year
- Recipes for learning, fine-tuning, and adapting ColPali to your multimodal RAG use cases. 👨🏻🍳☆356Jun 2, 2025Updated 10 months ago