JarvisUSTC / Awesome-Multimodal-RAGLinks
A curated list of the latest advancements, papers, tools, and datasets for **Multimodal Retrieval-Augmented Generation (RAG)**. Multimodal RAG integrates information retrieval and generation across multiple data modalities (e.g., text, image, video, audio).
☆22Updated 6 months ago
Alternatives and similar repositories for Awesome-Multimodal-RAG
Users that are interested in Awesome-Multimodal-RAG are comparing it to the libraries listed below
Sorting:
- Unsupervised GRPO☆39Updated last month
- The benchmark and datasets of the ICML 2024 paper "VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual C…☆14Updated last year
- [ACL'25] Mosaic-IT: Cost-Free Compositional Data Synthesis for Instruction Tuning☆19Updated 3 weeks ago
- A curated list of awesome LLM Inference-Time Self-Improvement (ITSI, pronounced "itsy") papers from our recent survey: A Survey on Large …☆85Updated 6 months ago
- Official Implementation for EMNLP 2024 (main) "AgentReview: Exploring Academic Peer Review with LLM Agent."☆77Updated 8 months ago
- ☆43Updated 2 months ago
- ☆21Updated 2 months ago
- 超简单复现Deepseek-R1-Zero和Deepseek-R1,以「24点游戏」为例。通过zero-RL、SFT以及SFT+RL,以激发LLM的自主验证反思能力。 About Clean, minimal, accessible reproduction of Dee…☆23Updated 3 months ago
- ROUTE: Robust Multitask Tuning and Collaboration for Text-to-SQL (ICLR 2025 Pytorch Code)☆15Updated 2 months ago
- X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains☆46Updated 2 months ago
- [NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models☆49Updated last month
- Code for "CREAM: Consistency Regularized Self-Rewarding Language Models", ICLR 2025.☆22Updated 5 months ago
- The official Github repository for paper "R^2AG: Incorporating Retrieval Information into Retrieval Augmented Generation" (EMNLP 2024 Fin…☆33Updated 7 months ago
- The official repository for the Scientific Paper Idea Proposer (SciPIP)☆62Updated 4 months ago
- Official repository of paper "Parameters vs. Context: Fine-Grained Control of Knowledge Reliance in Language Models"☆21Updated last month
- [ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model☆46Updated 8 months ago
- ☆22Updated last year
- ☆64Updated last month
- ☆83Updated 6 months ago
- TreeRL: LLM Reinforcement Learning with On-Policy Tree Search in ACL'25☆41Updated last month
- ☆54Updated 4 months ago
- ☆52Updated 5 months ago
- The codebase and some introductions of FineMed.☆23Updated 3 weeks ago
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆26Updated 7 months ago
- survery of small language models☆15Updated 11 months ago
- This repo contains the source code for VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks (NeurIPS 2024).☆39Updated 9 months ago
- Parameter-Efficient Fine-Tuning for Foundation Models☆75Updated 3 months ago
- This is the code of MMOA-RAG.☆60Updated 2 months ago
- ☆47Updated 5 months ago
- Code for our Paper "All in an Aggregated Image for In-Image Learning"☆30Updated last year