xmed-lab/TAM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xmed-lab/TAM)

xmed-lab / TAM

[ICCV25 Oral] Token Activation Map to Visually Explain Multimodal LLMs

☆189

Alternatives and similar repositories for TAM

Users that are interested in TAM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

saccharomycetes / mllms_know
View on GitHub
[ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'
☆380Apr 20, 2025Updated last year
xmed-lab / MedRegA
View on GitHub
[ICLR 2025] MedRegA: Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks
☆46Oct 18, 2025Updated 9 months ago
Sreyan88 / VDGD
View on GitHub
Code for ICLR 2025 Paper: Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs
☆25May 7, 2025Updated last year
seilk / VisAttnSink
View on GitHub
[ICLR 2025] See What You Are Told: Visual Attention Sink in Large Multimodal Models
☆116Feb 16, 2025Updated last year
bytedance / LVLM_Interpretation
View on GitHub
The official repo for "Where do Large Vision-Language Models Look at when Answering Questions?"
☆72Jan 7, 2026Updated 6 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
seilk / LocalizationHeads
View on GitHub
[CVPR 2025 Highlight] Your Large Vision-Language Model Only Needs A Few Attention Heads For Visual Grounding
☆79Aug 31, 2025Updated 10 months ago
zjysteven / VLM-Visualizer
View on GitHub
Visualizing the attention of vision-language models
☆304Feb 28, 2025Updated last year
ZhangqiJiang07 / middle_layers_indicating_hallucinations
View on GitHub
[CVPR 2025] Devils in Middle Layers of Large Vision-Language Models: Interpreting, Detecting and Mitigating Object Hallucinations via Att…
☆84Oct 9, 2025Updated 9 months ago
zhaochen0110 / Awesome_Think_With_Images
View on GitHub
Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual in…
☆1,492Mar 9, 2026Updated 4 months ago
Haochen-Wang409 / TreeVGR
View on GitHub
[ICLR'26] Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology
☆92Jan 26, 2026Updated 5 months ago
NK-JittorCV / nk-det
View on GitHub
An open source codebase for object detection based on Jittor
☆19Dec 9, 2025Updated 7 months ago
PKU-ICST-MIPL / DyFo_CVPR2025
View on GitHub
☆115Aug 14, 2025Updated 11 months ago
Cooperx521 / PyramidDrop
View on GitHub
(CVPR 2025) PyramidDrop: Accelerating Your Large Vision-Language Models via Pyramid Visual Redundancy Reduction
☆151Mar 6, 2025Updated last year
xmed-lab / UniEval
View on GitHub
UniEval: Unified Holistic Evaluation for Unified Multimodal Understanding and Generation
☆25May 16, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
xmed-lab / NEURONS
View on GitHub
[ICCV 2025] Neurons: Emulating the Human Visual Cortex Improves Fidelity and Interpretability in fMRI-to-Video Reconstruction
☆29Jul 5, 2026Updated 2 weeks ago
PKU-YuanGroup / Look-Back
View on GitHub
This repository is the official implementation of "Look-Back: Implicit Visual Re-focusing in MLLM Reasoning".
☆99Jul 10, 2025Updated last year
SVT-Yang / MedST
View on GitHub
Offical code of Unlocking the Power of Spatial and Temporal Information in Medical Multimodal Pre-training[ICML 2024]
☆26May 31, 2024Updated 2 years ago
yaolinli / DeCo
View on GitHub
Code for DeCo: Decoupling token compression from semanchc abstraction in multimodal large language models
☆80Jul 14, 2025Updated last year
zifuwan / ONLY
View on GitHub
[ICCV 2025] ONLY: One-Layer Intervention Sufficiently Mitigates Hallucinations in Large Vision-Language Models
☆51Jul 7, 2025Updated last year
xmed-lab / FoPro-KD
View on GitHub
TMI 2023: FoPro-KD: Fourier Prompted Effective Knowledge Distillation for Long-Tailed Medical Image Recognition
☆12Mar 19, 2024Updated 2 years ago
SaraGhazanfari / EMMA
View on GitHub
EMMA [TMLR 2025]
☆14Sep 25, 2025Updated 9 months ago
jungao1106 / ICoT
View on GitHub
[CVPR' 25] Interleaved-Modal Chain-of-Thought
☆112Dec 30, 2025Updated 6 months ago
itsqyh / Awesome-LMMs-Mechanistic-Interpretability
View on GitHub
A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository agg…
☆215Mar 4, 2026Updated 4 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
microsoft / SuperRL
View on GitHub
☆15Sep 8, 2025Updated 10 months ago
HKUSTGZ-ML4Health-Lab / Med-Scout
View on GitHub
Med-Scout: Curing MLLMs' Geometric Blindness in Medical Perception via Geometry-Aware RL Post-Training
☆16Feb 8, 2026Updated 5 months ago
LinjieMu / MMXU
View on GitHub
☆25Nov 27, 2025Updated 7 months ago
UCSC-VLAA / MedVLThinker
View on GitHub
[ML4H'25] MedVLThinker: Simple Baselines for Multimodal Medical Reasoning
☆59Dec 21, 2025Updated 7 months ago
zjunlp / Deco
View on GitHub
[ICLR 2025] MLLM can see? Dynamic Correction Decoding for Hallucination Mitigation
☆146Sep 11, 2025Updated 10 months ago
xmed-lab / CLSS
View on GitHub
The official implementation of GCLSS (Generalized CLSS) and CLSS (NeurIPS 2023: Semi-Supervised Contrastive Learning for Deep Regression …
☆15Apr 11, 2026Updated 3 months ago
CUHK-AIM-Group / MCPL
View on GitHub
MCPL: Multi-modal Collaborative Prompt Learning for Medical Vision-Language Model (Initial Version)
☆13Apr 17, 2024Updated 2 years ago
yaotingwangofficial / Awesome-MCoT
View on GitHub
Multimodal Chain-of-Thought Reasoning: A Comprehensive Survey
☆1,016May 22, 2026Updated last month
Cocofeat / uMedGround
View on GitHub
【IEEE TPAMI 2025】Uncertainty-aware Medical Diagnostic Phrase Identification and Grounding
☆35Jul 9, 2026Updated last week
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
RuoyuChen10 / EAGLE
View on GitHub
[CVPR 2026] Where MLLMs Attend and What They Rely On: Explaining Autoregressive Token Generation
☆44Jun 18, 2026Updated last month
UCSC-VLAA / MedVLSynther
View on GitHub
[ICLR'26] MedVLSynther: Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs
☆19Nov 1, 2025Updated 8 months ago
mrwu-mac / ControlMLLM
View on GitHub
[NeurIPS2024] Repo for the paper `ControlMLLM: Training-Free Visual Prompt Learning for Multimodal Large Language Models'
☆210Jul 17, 2025Updated last year
Stanford-AIMI / radgraph
View on GitHub
☆78Jul 10, 2026Updated last week
claws-lab / projection-in-MLLMs
View on GitHub
Code and data for ACL 2024 paper on 'Cross-Modal Projection in Multimodal LLMs Doesn't Really Project Visual Attributes to Textual Space'
☆18Jul 21, 2024Updated 2 years ago
LzVv123456 / VISTA
View on GitHub
☆86Jul 28, 2025Updated 11 months ago
MLRM-Halu / MLRM-Halu
View on GitHub
[NeurIPS 2025] More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models
☆82May 31, 2025Updated last year