thaoshibe / awesome-personalized-lmmsLinks
A curated list of Awesome Personalized Large Multimodal Models resources
☆46Updated 3 weeks ago
Alternatives and similar repositories for awesome-personalized-lmms
Users that are interested in awesome-personalized-lmms are comparing it to the libraries listed below
Sorting:
- [ECCV 2024] API: Attention Prompting on Image for Large Vision-Language Models☆103Updated last year
- [ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation☆116Updated last month
- [CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention☆48Updated last year
- ☆142Updated 8 months ago
- ☆21Updated 5 months ago
- Official Repository of Personalized Visual Instruct Tuning☆32Updated 7 months ago
- [CVPR2024 Highlight] Official implementation for Transferable Visual Prompting. The paper "Exploring the Transferability of Visual Prompt…☆44Updated 9 months ago
- ☆57Updated 2 months ago
- Official repository for CoMM Dataset☆48Updated 9 months ago
- The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?☆38Updated 11 months ago
- [CVPR 2025] RAP: Retrieval-Augmented Personalization☆71Updated 2 months ago
- [ICLR 2025] VL-ICL Bench: The Devil in the Details of Multimodal In-Context Learning☆65Updated 3 weeks ago
- MME-Unify: A Comprehensive Benchmark for Unified Multimodal Understanding and Generation Models☆41Updated 6 months ago
- Doodling our way to AGI ✏️ 🖼️ 🧠☆107Updated 4 months ago
- Hyperbolic Safety-Aware Vision-Language Models. CVPR 2025☆25Updated 6 months ago
- Code for ICLR 2025 Paper: Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs☆21Updated 5 months ago
- [ECCV 2024] Paying More Attention to Image: A Training-Free Method for Alleviating Hallucination in LVLMs☆145Updated 11 months ago
- [ACM Multimedia 2025] This is the official repo for Debiasing Large Visual Language Models, including a Post-Hoc debias method and Visual…☆82Updated 7 months ago
- [ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Models☆54Updated 8 months ago
- [NeurIPS 2025 Spotlight] Think or Not Think: A Study of Explicit Thinking in Rule-Based Visual Reinforcement Fine-Tuning☆68Updated last month
- Official repo of M$^2$PT: Multimodal Prompt Tuning for Zero-shot Instruction Learning☆27Updated 6 months ago
- 🌋👵🏻 Yo'LLaVA: Your Personalized Language and Vision Assistant☆115Updated 6 months ago
- ☆20Updated 11 months ago
- HalluciDoctor: Mitigating Hallucinatory Toxicity in Visual Instruction Data (Accepted by CVPR 2024)☆48Updated last year
- ☆11Updated last year
- [CVPR 2025] Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Att…☆43Updated last week
- Official pytorch implementation of "RITUAL: Random Image Transformations as a Universal Anti-hallucination Lever in Large Vision Language…☆13Updated 10 months ago
- ☆25Updated 3 months ago
- [NeurIPS 2024] Calibrated Self-Rewarding Vision Language Models☆80Updated last year
- Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering☆84Updated 10 months ago