Hoar012/RAP-MLLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Hoar012/RAP-MLLM)

Hoar012 / RAP-MLLM

[CVPR 2025] RAP: Retrieval-Augmented Personalization

☆87

Alternatives and similar repositories for RAP-MLLM

Users that are interested in RAP-MLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Deepayan137 / R2P
View on GitHub
Official codebase for the paper "Training-Free Personalization via Retrieval and Reasoning on Fingerprints"
☆25Nov 6, 2025Updated 8 months ago
Jolieresearch / ICPF
View on GitHub
☆14Nov 26, 2025Updated 8 months ago
thaoshibe / awesome-personalized-lmms
View on GitHub
A curated list of Awesome Personalized Large Multimodal Models resources
☆59Jun 18, 2026Updated last month
ronpay / ExMRD
View on GitHub
[WWW 2025] Following Clues, Approaching the Truth: Explainable Micro-Video Rumor Detection via Chain-of-Thought Reasoning
☆25Updated this week
ICDM-UESTC / MMRA
View on GitHub
MMRA: Predicting Micro-video Popularity via Multi-modal Retrieval Augmentation, ACM SIGIR Conference on Research and Development in Infor…
☆26Feb 7, 2026Updated 5 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
LiRunyi2001 / OmniSSR
View on GitHub
Code for paper OmniSSR
☆25Apr 21, 2025Updated last year
yongliang-wu / NumPro
View on GitHub
[CVPR2025] Number it: Temporal Grounding Videos like Flipping Manga
☆150Jan 19, 2026Updated 6 months ago
PattonYu / CUFAR
View on GitHub
☆12Feb 24, 2023Updated 3 years ago
Xovee / skapp
View on GitHub
AAAI '25. Retrieval-Augmented Multimodal Social Media Popularity Prediction
☆24Jul 8, 2026Updated 2 weeks ago
Hoar012 / TDC-Video
View on GitHub
Official implementation of TDC.
☆15Jul 22, 2025Updated last year
dragonlzm / PAVE
View on GitHub
This repo holds the implementation of PAVE: Patching and Adapting Video Large Language Models (CVPR2025)
☆27Sep 6, 2025Updated 10 months ago
DavidYan2001 / PVChat
View on GitHub
[ICCV 2025] PVChat: Personalized Video Chat with One-Shot Learning
☆17Apr 4, 2026Updated 3 months ago
Jian-Lang / RAGPT
View on GitHub
This repo is the official implementation of "Retrieval-Augmented Dynamic Prompt Tuning for Incomplete Multimodal Learning" accepted by AA…
☆67May 26, 2026Updated 2 months ago
snap-research / MyVLM
View on GitHub
Official Implementation for "MyVLM: Personalizing VLMs for User-Specific Queries" (ECCV 2024)
☆188Jul 5, 2024Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
WisconsinAIVision / YoLLaVA
View on GitHub
🌋👵🏻 Yo'LLaVA: Your Personalized Language and Vision Assistant (NeurIPS 2024)
☆123Mar 26, 2025Updated last year
ExplainableML / Vision_by_Language
View on GitHub
[ICLR 2024] Official repository for "Vision-by-Language for Training-Free Compositional Image Retrieval"
☆89Jul 4, 2024Updated 2 years ago
sterzhang / PVIT
View on GitHub
Official Repository of Personalized Visual Instruct Tuning
☆34Mar 6, 2025Updated last year
LunarShen / DsicoVLA
View on GitHub
[CVPR 2025] DiscoVLA: Discrepancy Reduction in Vision, Language, and Alignment for Parameter-Efficient Video-Text Retrieval
☆22Jun 23, 2025Updated last year
UCSB-AI / ComCLIP
View on GitHub
Official implementation and dataset for the NAACL 2024 paper "ComCLIP: Training-Free Compositional Image and Text Matching"
☆37Aug 18, 2024Updated last year
lxd99 / CTCP
View on GitHub
code and data for Continuous-Time Graph Learning for Cascade Popularity Prediction, IJCAI 2023
☆11Jul 3, 2023Updated 3 years ago
celi52 / STHN
View on GitHub
STHN: Simplifying Temporal Heterogeneous Network for Continuous-Time Link Prediction [CIKM 2023]
☆11Oct 30, 2023Updated 2 years ago
r2llab / GTTA
View on GitHub
This codebase is to reproduce the results of the paper "Grounded Test-Time Adaptation for LLM Agents".
☆17Mar 4, 2026Updated 4 months ago
taolinzhang / BoostAdapter
View on GitHub
[NeurIPS2024] BoostAdapter: Improving Test-Time Adaptation via Regional Bootstrapping
☆21Feb 28, 2026Updated 4 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
wistful-8029 / BTP-3DAD
View on GitHub
☆18Jun 22, 2026Updated last month
ZBox1005 / CoT-UQ
View on GitHub
[ACL 2025] "CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought"
☆17Apr 3, 2025Updated last year
ZjjConan / VLM-LwEIB
View on GitHub
The official pytorch implemention of our IJCV-2025 paper "Learning with Enriched Inductive Biases for Vision-Language Models".
☆15Jul 6, 2026Updated 2 weeks ago
luoyixin2019 / JumpGame
View on GitHub
利用Unity复刻的跳一跳小游戏
☆10Apr 7, 2021Updated 5 years ago
SsGood / ADGCN
View on GitHub
Pytorch Implementation for paper "Adversarial Graph Disentanglement"
☆13Jul 18, 2023Updated 3 years ago
Sugewud / Safe-Sora
View on GitHub
[NeurIPS 2025] The official implementation of paper "Safe-Sora: Safe Text-to-Video Generation via Graphical Watermarking"
☆20Oct 10, 2025Updated 9 months ago
dvirsamuel / PDM
View on GitHub
Code for our paper: "Where's Waldo: Diffusion Features For Personalized Segmentation and Retrieval".
☆14Feb 26, 2025Updated last year
YucanGuo / RouteRAG
View on GitHub
RouteRAG: Efficient Retrieval-Augmented Generation from Text and Graph via Reinforcement Learning
☆36Jul 1, 2026Updated 3 weeks ago
Disguiser15 / RefTeacher
View on GitHub
RefTeacher is a strong baseline method for Semi-Supervised Referring Expression Comprehension.
☆14May 26, 2023Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
OVAD-Benchmark / ovad-benchmark-code
View on GitHub
OVAD: Open-vocabulary Attribute Detection code
☆30Aug 28, 2023Updated 2 years ago
hulianyuyy / iLLaVA
View on GitHub
iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models (ICLR2026)
☆23Jun 24, 2026Updated last month
SEU-VIPGroup / Understanding_Vision_Tasks
View on GitHub
☆13Feb 2, 2025Updated last year
SHI-Labs / Slow-Fast-Video-Multimodal-LLM
View on GitHub
☆29Apr 8, 2025Updated last year
GT-RIPL / DistillMatch-SSCL
View on GitHub
PyTorch code for the IJCNN'21 paper: "Memory-Efficient Semi-Supervised Continual Learning: The World is its Own Replay Buffer"
☆14Oct 17, 2022Updated 3 years ago
whwu95 / FreeVA
View on GitHub
FreeVA: Offline MLLM as Training-Free Video Assistant
☆69Jun 9, 2024Updated 2 years ago
ocbe-uio / imml
View on GitHub
A Python package for integrating, processing, and analyzing incomplete multi-modal datasets.
☆26Jul 7, 2026Updated 2 weeks ago