seilk / VisAttnSinkLinks
[ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Models
☆69Updated 9 months ago
Alternatives and similar repositories for VisAttnSink
Users that are interested in VisAttnSink are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention☆50Updated last year
- [CVPR 2025 Highlight] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding☆43Updated 2 months ago
- [NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'