JarvisUSTC / Awesome-Multimodal-RAGView external linksLinks
A curated list of the latest advancements, papers, tools, and datasets for **Multimodal Retrieval-Augmented Generation (RAG)**. Multimodal RAG integrates information retrieval and generation across multiple data modalities (e.g., text, image, video, audio).
☆48Nov 25, 2025Updated 2 months ago
Alternatives and similar repositories for Awesome-Multimodal-RAG
Users that are interested in Awesome-Multimodal-RAG are comparing it to the libraries listed below
Sorting:
- Official Implementation of LatentSwap3D: Semantic Edits on 3D Image GANs☆23Nov 28, 2023Updated 2 years ago
- An Open-Source RAG Workload Trace to Optimize RAG Serving Systems☆35Nov 18, 2025Updated 2 months ago
- Official repository for the paper "Reconstruction of Perceived Images from fMRI Patterns and Semantic Brain Exploration using Instance-Co…☆24May 19, 2022Updated 3 years ago
- Vecna is a Python chatbot which recommends songs and movies depending upon your feelings☆11Jun 28, 2022Updated 3 years ago
- Code for ISBI'19 Tutorial☆35Jan 6, 2026Updated last month
- MRAugment: physics-aware data augmentation for deep learning based accelerated MRI reconstruction☆31May 5, 2022Updated 3 years ago
- 将Wav2Lip和GFPGAN进行结合实现高清数字人说话视频☆37Jun 2, 2025Updated 8 months ago
- official implementation☆37Jul 29, 2023Updated 2 years ago
- ☆25Sep 1, 2025Updated 5 months ago
- This project predicts wind turbine failure using numerous sensor data by applying classification based ML models that improves prediction…☆11Mar 20, 2023Updated 2 years ago
- ☆10Jul 29, 2022Updated 3 years ago
- Belief Revision based Caption Re-ranker with Visual Semantic Information. COLING 2022☆11Apr 13, 2025Updated 10 months ago
- We archive data because we are interested in the diffs. All data is from https://video-api.cartoonnetwork.com. We run the check every min…☆10Updated this week
- An open source implementation of R1☆29Feb 9, 2026Updated last week
- [CVPR 2022] Official PyTorch implementation for Attributable Visual Similarity Learning☆35Oct 17, 2022Updated 3 years ago
- Extract information from XBRL files in the ESEF format☆13Jan 3, 2026Updated last month
- Tally Prime MCP (Model Context Protocol) Server implementation to feed Tally ERP data to popular LLM like Claude, ChatGPT supporting MCP☆14Nov 11, 2025Updated 3 months ago
- 在RAG技术中,嵌入向量的生成和匹配是关键环节。本文介绍了一种基于CLIP/BLIP模型的嵌入服务,该服务支持文本和图像的嵌入生成与相似度计算,为多模态信息检索提供了基础能力。☆42Dec 28, 2024Updated last year
- 增加了indextts2的简单的界面与api调用方式☆20Oct 27, 2025Updated 3 months ago
- Vector search with Pinecone and Openai to search through contract law textbook. If downloaded, remeber to install all dependencies. Refer…☆13Mar 30, 2023Updated 2 years ago
- Hands-on with popular deep learning datasets and tasks☆13Apr 4, 2023Updated 2 years ago
- Self-Supervised MRI Reconstruction☆10May 25, 2021Updated 4 years ago
- WindTurbineHighSpeedBearingPrognosis-Data☆10Aug 19, 2020Updated 5 years ago
- 该仓库是 BUPT 智能系统实验室的法律大模型项目,基于 ChatGLM 等开源大模型进行实现。☆11Nov 28, 2023Updated 2 years ago
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Jul 19, 2022Updated 3 years ago
- generate video with voice narration from ppt/pdf Slides☆10Sep 4, 2023Updated 2 years ago
- free library for clustering and neuro-fuzzy systems☆10Jan 28, 2026Updated 2 weeks ago
- ImageQA is a tool for analyzing digital image quality according to specific attributes such as color, tone transfer, noise or resolution.…☆10Sep 18, 2024Updated last year
- Deepfake faces detection from forged videos where used explainable AI for models' robustness as well as cost sensitive methods for mitiga…☆10May 27, 2024Updated last year
- A gym game for Contra that for reinforcement learning☆10Oct 18, 2021Updated 4 years ago
- A reddit scraping and analysis bot to visualize linguistic and content trends☆12Oct 5, 2021Updated 4 years ago
- EEG-based Major Depression Disorder Recognition using Swin Transformers☆10Jun 23, 2024Updated last year
- Scraping LegiFrance naturalisation decrees for fun and OSINT profit☆11May 27, 2023Updated 2 years ago
- Image reconstruction from human brain activity by VAE and adversarial learning☆12May 21, 2022Updated 3 years ago
- This is a dehazed method for remote sensing image, which based on CycleGAN.☆12May 10, 2022Updated 3 years ago
- AI Manga Editor capable of text recognition, translation, inpainting and editing.☆20Mar 25, 2025Updated 10 months ago
- A simple GPT-3 interface to automate core legal writing tasks☆11Mar 8, 2023Updated 2 years ago
- Documentation and code for predictive maintenance data and assess scripts.☆11Jun 8, 2023Updated 2 years ago
- This Repo contains a fully functional API ready application for delineating fields for smart farming platform☆15Jan 20, 2023Updated 3 years ago