A curated list of the latest advancements, papers, tools, and datasets for **Multimodal Retrieval-Augmented Generation (RAG)**. Multimodal RAG integrates information retrieval and generation across multiple data modalities (e.g., text, image, video, audio).
☆53Nov 25, 2025Updated 7 months ago
Alternatives and similar repositories for Awesome-Multimodal-RAG
Users that are interested in Awesome-Multimodal-RAG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents☆42Apr 13, 2026Updated 2 months ago
- ☆24Apr 9, 2025Updated last year
- Working note for WSI analysis☆10Apr 3, 2023Updated 3 years ago
- Official Implementation of LatentSwap3D: Semantic Edits on 3D Image GANs☆23Nov 28, 2023Updated 2 years ago
- ☆15May 7, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- 在RAG技术中,嵌入向量的生成和匹配是关键环节。本文介绍了一种基于CLIP/BLIP模型的嵌入服务,该服务支持文本和图像的嵌入生成与相似度计算,为多模态信息检索提供了基础能力。☆42Dec 28, 2024Updated last year
- [SIGIR '26] Mixture-of-Retrieval Experts for Reasoning-Guided Multimodal Knowledge Exploitation☆44Jun 13, 2026Updated 2 weeks ago
- An Open-Source RAG Workload Trace to Optimize RAG Serving Systems☆37Nov 18, 2025Updated 7 months ago
- Memory experiments with LLMs☆10Mar 31, 2023Updated 3 years ago
- LONGAGENT: Scaling Language Models to 128k Context through Multi-Agent Collaboration☆11Mar 11, 2024Updated 2 years ago
- [ACL 2024 Oral] This is the code repo for our ACL‘24 paper "MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Mo…☆39Jun 30, 2024Updated last year
- Official repo for [ICLR 2026] "AnesSuite: A Comprehensive Benchmark and Dataset Suite for Anesthesiology Reasoning in LLMs"☆25Feb 28, 2026Updated 4 months ago
- An open source implementation of R1☆31Updated this week
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆12Jun 18, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Official repository for the paper "Reconstruction of Perceived Images from fMRI Patterns and Semantic Brain Exploration using Instance-Co…☆24May 19, 2022Updated 4 years ago
- ACL 2026 & NAACL 2025: Bridging Retrieval and Inference through Evidence Fusion☆14Apr 9, 2026Updated 2 months ago
- ☆12Jan 10, 2025Updated last year
- The codebase and some introductions of FineMed.☆31Sep 11, 2025Updated 9 months ago
- Just a simple Android app that uses Rokid's CXR-M SDK to upload/sideload an APK onto your Rokid glasses over Wi-Fi. It might be hard to g…☆56Apr 9, 2026Updated 2 months ago
- ☆12Jun 21, 2020Updated 6 years ago
- Happy Hacking With Claude!!!☆25Oct 27, 2025Updated 8 months ago
- Toolkit to help you do better research☆11Apr 19, 2019Updated 7 years ago
- [NeurIPS 2025] Search and Refine During Think: Facilitating Knowledge Refinement for Improved Retrieval-Augmented Reasoning☆139Dec 13, 2025Updated 6 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Code for "Neural Network-based Reconstruction in Compressed Sensing MRI Without Fully-sampled Training Data"☆12Jan 5, 2021Updated 5 years ago
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆16Sep 4, 2025Updated 9 months ago
- An LLM-based fuzzing framework for C compilers testing.☆25Dec 14, 2025Updated 6 months ago
- ☆20Mar 11, 2025Updated last year
- ☆18May 4, 2023Updated 3 years ago
- TransLaTeX is a simple tool for translating LaTeX projects using Large Language Models. It can automatically translate LaTeX sources from…☆24Jun 26, 2024Updated 2 years ago
- ☆15Jan 15, 2024Updated 2 years ago
- ppt转数字人后台☆20Apr 9, 2025Updated last year
- Extension of Neural Radiance Feilds (Mildenhall et al 2020) to perform 3D style transfer. Implementation in PyTorch Lightning.☆14Oct 18, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code for ISBI'19 Tutorial☆36Jan 6, 2026Updated 5 months ago
- official implementation☆37Jul 29, 2023Updated 2 years ago
- [ACL2026 Findings] "Towards Hierarchical Multi-Step Reward Models for Enhanced Reasoning in Large Language Models"☆20Mar 25, 2025Updated last year
- ☆13Dec 22, 2021Updated 4 years ago
- [CVPR 2022] Official PyTorch implementation for Attributable Visual Similarity Learning☆34Oct 17, 2022Updated 3 years ago
- ☆51May 16, 2026Updated last month
- A curated collection of projects, benchmarks, and research papers focused on reproducing and advancing the DeepSeek R1 framework.☆15Mar 19, 2025Updated last year