A curated list of the latest advancements, papers, tools, and datasets for **Multimodal Retrieval-Augmented Generation (RAG)**. Multimodal RAG integrates information retrieval and generation across multiple data modalities (e.g., text, image, video, audio).
☆51Nov 25, 2025Updated 5 months ago
Alternatives and similar repositories for Awesome-Multimodal-RAG
Users that are interested in Awesome-Multimodal-RAG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Look Back to Reason Forward: Revisitable Memory for Long-Context LLM Agents☆35Apr 13, 2026Updated 3 weeks ago
- An Open-Source RAG Workload Trace to Optimize RAG Serving Systems☆36Nov 18, 2025Updated 5 months ago
- 在RAG技术中,嵌入向量的生成和匹配是关键环节。本文介绍了一种基于CLIP/BLIP模型的嵌入服务,该服务支持文本和图像的嵌入生成与相似度计算,为多模态信息检索提供了基础能力。☆42Dec 28, 2024Updated last year
- Implementation and evaluation of multimodal RAG with text and image inputs for industrial applications☆70Nov 6, 2024Updated last year
- ☆15Jan 23, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [SIGIR '26] Mixture-of-Retrieval Experts for Reasoning-Guided Multimodal Knowledge Exploitation☆40Apr 8, 2026Updated last month
- Memory experiments with LLMs☆10Mar 31, 2023Updated 3 years ago
- [ACL 2024 Oral] This is the code repo for our ACL‘24 paper "MARVEL: Unlocking the Multi-Modal Capability of Dense Retrieval via Visual Mo…☆39Jun 30, 2024Updated last year
- This repository is intended to take down what I learn from a book named Python3网络爬虫开发实战(第2版).☆11Mar 29, 2023Updated 3 years ago
- 在index-tts-vllm的基础上,实现了并提供了模拟流式合成音频的接口服务及客户端测试脚本☆26Sep 2, 2025Updated 8 months ago
- Face++ 是一款基于 Android 平台开发的创新性 AI 面相分析应用。它巧妙地将中国传统面相学理论(如“三庭五眼”和“十二宫”)与现代人工智能技术相结合,为用户提供一份专业、详尽且富有洞察力的面相分析报告☆22Jul 14, 2025Updated 9 months ago
- The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning (NeurIPS 2022)☆16Feb 11, 2023Updated 3 years ago
- Official repository for the paper "Reconstruction of Perceived Images from fMRI Patterns and Semantic Brain Exploration using Instance-Co…☆24May 19, 2022Updated 3 years ago
- Reference implementation of the HEAT algorithm described in https://link.springer.com/chapter/10.1007/978-3-030-62362-3_4☆11Mar 24, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ICLR 2021 (spotlight): Graph Convolution with Low-rank Learnable Local Filters☆16Jan 14, 2021Updated 5 years ago
- [NeurIPS 2025] Search and Refine During Think: Facilitating Knowledge Refinement for Improved Retrieval-Augmented Reasoning☆135Dec 13, 2025Updated 4 months ago
- Happy Hacking With Claude!!!☆25Oct 27, 2025Updated 6 months ago
- 增加了indextts2的简单的界面与api调用方式☆27Oct 27, 2025Updated 6 months ago
- Code for "Neural Network-based Reconstruction in Compressed Sensing MRI Without Fully-sampled Training Data"☆12Jan 5, 2021Updated 5 years ago
- Evaluate state-of-the-art sparse embedding models on the LIMIT dataset (`limit-small` and `limit`) from google's paper `On the Theoretica…☆16Sep 4, 2025Updated 8 months ago
- AI驱动的虚拟数字人直播系统,支持2D/3D数字人、TTS、ASR、唇形同步、推流、互动等模块化开发。☆24May 13, 2025Updated 11 months ago
- ☆19Mar 11, 2025Updated last year
- [ICLR 2025] Vision-Centric Evaluation for Retrieval-Augmented Multimodal Models☆61Jan 22, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Example implementations of Claude's Memory Tool API - Next.js web app and Python CLI for building applications with persistent memory☆53Oct 14, 2025Updated 6 months ago
- ☆19May 4, 2023Updated 3 years ago
- repo for paper: Adaptive Checkpoint Adjoint (ACA) method for gradient estimation in neural ODE☆56Mar 13, 2021Updated 5 years ago
- ☆15Jan 15, 2024Updated 2 years ago
- ppt转数字人后台☆19Apr 9, 2025Updated last year
- Extension of Neural Radiance Feilds (Mildenhall et al 2020) to perform 3D style transfer. Implementation in PyTorch Lightning.☆14Oct 18, 2021Updated 4 years ago
- 一个开源的多模态 AI 搜索项目,结合 大语言模型(LLM)+ 多源搜索引擎 + 多 Agent 架构,打造新一代的智能问答式搜索体验☆17Mar 26, 2025Updated last year
- Code for ISBI'19 Tutorial☆36Jan 6, 2026Updated 4 months ago
- official implementation☆37Jul 29, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆25Sep 1, 2025Updated 8 months ago
- [ACL2026 Findings] "Towards Hierarchical Multi-Step Reward Models for Enhanced Reasoning in Large Language Models"☆20Mar 25, 2025Updated last year
- [CVPR 2022] Official PyTorch implementation for Attributable Visual Similarity Learning☆34Oct 17, 2022Updated 3 years ago
- A curated collection of projects, benchmarks, and research papers focused on reproducing and advancing the DeepSeek R1 framework.☆15Mar 19, 2025Updated last year
- Image reconstruction from human brain activity by VAE and adversarial learning☆12May 21, 2022Updated 3 years ago
- 很多东西无法发布在csdn或者b站,因为涉及黑客技术,过于恐怖,或者涉及ipynb,无法正常展示。我发布到这里,总有允许记录这些内容的地方!☆13Dec 4, 2025Updated 5 months ago
- Personalized Fragrance Recommendation for Aromatherapy: A Machine Learning Approach Based on Personality Traits and Electrodermal Activit…☆10May 1, 2025Updated last year