Parsing-free RAG supported by VLMs
☆956Dec 7, 2025Updated 5 months ago
Alternatives and similar repositories for VisRAG
Users that are interested in VisRAG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.☆2,626May 12, 2026Updated last week
- Official PyTorch Implementation of MLLM Is a Strong Reranker: Advancing Multimodal Retrieval-augmented Generation via Knowledge-enhanced …☆92Nov 15, 2024Updated last year
- This is the code repo for the paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rewards".☆24Oct 28, 2024Updated last year
- [EMNLP 2025] ViDoRAG: Visual Document Retrieval-Augmented Generation via Dynamic Iterative Reasoning Agents☆659Jan 11, 2026Updated 4 months ago
- ☆59Oct 18, 2024Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆129Nov 6, 2024Updated last year
- [EMNLP 2024: Demo Oral] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation☆309Oct 18, 2024Updated last year
- Repo for Benchmarking Multimodal Retrieval Augmented Generation with Dynamic VQA Dataset and Self-adaptive Planning Agent☆423Apr 22, 2025Updated last year
- [CVPR2025] VDocRAG: Retirval-Augmented Generation over Visually-Rich Documents☆67May 26, 2025Updated 11 months ago
- Empowering RAG with a memory-based data interface for all-purpose applications!☆2,242Sep 11, 2025Updated 8 months ago
- ⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)☆3,490Apr 10, 2026Updated last month
- [SIGIR '26] Mixture-of-Retrieval Experts for Reasoning-Guided Multimodal Knowledge Exploitation☆40Updated this week
- A simple, easy-to-hack GraphRAG implementation☆3,845Jan 27, 2026Updated 3 months ago
- Vision-Augmented Retrieval and Generation (VARAG) - Vision first RAG Engine☆500Jul 23, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Multimodal Retrieval-augmented Generation Framework Built by Tongyi Lab, Alibaba Group.☆926Apr 29, 2026Updated 3 weeks ago
- Retrieval and Retrieval-augmented LLMs☆11,686Apr 22, 2026Updated 3 weeks ago
- This is the code repo for our paper "Learning More Effective Representations for Dense Retrieval through Deliberate Thinking Before Searc…☆28Mar 2, 2025Updated last year
- MDocAgent: A Multi-Modal Multi-Agent Framework for Document Understanding☆335Aug 8, 2025Updated 9 months ago
- Solve Visual Understanding with Reinforced VLMs☆5,956Mar 12, 2026Updated 2 months ago
- ☆133Apr 7, 2025Updated last year
- Code for our paper: "Building A Coding Assistant via Retrieval-Augmented Language Models"☆10Nov 2, 2024Updated last year
- mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding☆2,406May 30, 2025Updated 11 months ago
- [EMNLP 2024 Findings] The official PyTorch implementation of EchoSight: Advancing Visual-Language Models with Wiki Knowledge.☆83Jan 19, 2026Updated 4 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A new novel multi-modality (Vision) RAG architecture☆39Oct 1, 2024Updated last year
- 🔍 Search-o1: Agentic Search-Enhanced Large Reasoning Models [EMNLP 2025]☆1,221Nov 17, 2025Updated 6 months ago
- Implementation and evaluation of multimodal RAG with text and image inputs for industrial applications☆71Nov 6, 2024Updated last year
- ☆29Aug 19, 2024Updated last year
- This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai,…☆2,381May 25, 2024Updated last year
- ☆12Jul 13, 2023Updated 2 years ago
- A Low-Code MCP Framework for Building Complex and Innovative RAG Pipelines☆5,556May 13, 2026Updated last week
- [ACL 2024 Oral] This is the code repo for our ACL‘24 paper "MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Mo…☆39Jun 30, 2024Updated last year
- ☆41Jan 9, 2026Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Use late-interaction multi-modal models such as ColPali in just a few lines of code.☆847Jan 28, 2025Updated last year
- [NeurIPS'24] HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge a…☆3,523Sep 4, 2025Updated 8 months ago
- Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model☆8,120Feb 10, 2025Updated last year
- KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning a…☆8,763Jan 28, 2026Updated 3 months ago
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆33,016May 13, 2026Updated last week
- [ACL-2026] MMSearch-R1 is an end-to-end RL framework that enables LMMs to perform on-demand, multi-turn search with real-world multimodal…☆443Apr 7, 2026Updated last month
- [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.☆24,802Aug 12, 2024Updated last year